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LVQ Neural Network Parameter 

Design Value 

Input Signals 

MCC PC AVG 

(PC_CNTL_REF - MCC_PC_AVG) 
(APC_CNTL_REF / ATIME) 

Number of Classes 

9 (one for each operating mode 
defined in Figure 21) 

Number of Output Nodes per Class (N C ias S ) 

8 (for each operating mode) 

Learning Algorithm 

LVQ2.1 

Maximum Number of Epochs (n eps ) 

250 

Initial Value of the Learning Rate ( Xq ) 

0.02 

Window Size (e) 

0.5 

Number of Training Vector Pairs (nuvo) 

500 


Figure 17 
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SSME Signal 
Parameter ID 

SSME Signal 
Parameter Name 

PID40 

OPOV_ACT_POS_A 

PID42 

FPOV_ACT_POS_A 

PID52 

HPFP_DS_P_A 

PID58 

FPB_PC_A 

PID63 

MCC_PC_AVG 

PID90 

HPOPDSP 

PID100 

FUEL_FLOW_AVG 

PID105 

HPFT_DS_T_A2 

PID106 

HPFT_DS_T_A3 

P1D107 

HPFT_DS_T_B2 

PID108 

H P FT_D S_T_B3 

PID200 

MCC_PC_A_AVG 

PID201 

MCC„PC_B_AVG 

PID205 

HPOT_DS_T_A2 

PID206 

H POT_DS_T_A3 

PID207 

HPOT_DS_T_B2 

PID208 

HPOT_DS_T_B3 
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Space Shuttle 
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Model 

Description 

Number of 
Signals 

Modeled 

Operating 

Modes 

Number of 
Process 
Mem Vectors 

Parameter 

Estimation 

Method 

Fault 

Detection 

Method 

LVQ mode- 

17 

StartOI 

150 

MSET 

SPRT mean 

partitioned 

(defined in 

Start12 

150 


pos & neg 

(Model PD) 

Figure 18) 

Start24 

150 





SteadyFull 

150 





SteadyLow 

150 





Upthrust 

150 





Downthrust 

150 



All-modes 

17 

All 

150 

MSET 

SPRT mean 

(Model A1 50) 

(same) 




pos & neg 

All-modes 

17 

All 

300 

MSET 

SPRT mean 

(Model A300) 

(same) 




pos & neg 
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Mode Name 

Operating Mode Criteria 

PREFIRE 

All observations preceding the vehicle start command to 
the engine. Considered a non-operating mode. 

START01 

Controller cycles 0 through 24 after receipt of the engine 
start command. 

START12 

Controller cycles 25 through 49 after receipt of the engine 
start command. 

START24 

Controller cycle 50 through detection of steady-state 
operation (typically at controller cycle -110). The rule for 
transition to STEADY FULL is: If in START24 and if 
|PC CNTL REF - MCC PC AVG | <= (5 * 3.35), and 
PC CNTL_REF >= 2500 transition to STEADY_FULL 

STEADY_FULL 

STEADY_FULL is declared when either: 1 ) the last cycle’s 
state was STEADY_FULL and the commanded PC is both 
unchanged and greater than 2500 psi, e.g., 

| PC CNTL REF -LAST PC CNTL REF | < 3.35 and 
PC_CNTL_REF >= 2500; 

or 2) when the last cycle’s state was a transient state and 
| PC CNTL REF - MCC PC AVG | <= (5 * 3.35) and 
PC CNTLREF >= 2500. 

STEADY_LOW 

STEADY _LOW is declared when either: 1 ) the last cycle’s 
state was STEADY_LOW and the commanded PC is both 
unchanged and less than 2500 psi, e.g., 

|PC CNTL REF -LAST PC CNTL REF | < 3.35 and 
PC_CNTL_REF < 2500; 

or 2) when the last cycle’s state was a transient state and 
| PC CNTL REF - MCC PC AVG | <= (5 * 3.35) and 
PC_CNTL_REF < 2500. 

UPTHRUST 

UPTHRUST is declared when 1) the commanded PC has 
increased since the last cycle, and 2) 

| PC_CNTL_REF - MCC_PC_AVG | > (5 * 3.35) 

DOWNTHRUST 

DOWNTHRUST is declared when 1) the commanded PC 
has decreased since the last cycle, and 2) 

| PC CNTL REF - MCC_PC_AVG | > (5 * 3.35) 

SHUTDOWN 

All observations following the vehicle shutdown command 
to the engine. Considered a non-operating mode. 


Figure 21 




U.S. Patent jui. 12,2005 


Sheet 22 of 36 


US 6,917,839 B2 


0 

O 

o 

O 

o 

O 

o 

o 

o 

u 

o 

o 

n 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


CO 

« 

CO 

CO 

CO 

0 

0 

w 

CO 

CO 

CO 

o E 

E 

E 

E 

E 

E 

E 

E 

E 

E 

E 

s 

> 

00 

o 

OO 


— 

h- 

CD 

OO 

CM 

CM 

00 

00 

GO 

CD 

CD 

o 

o 

CD 

03 

O 

o 

CD 

< 





in 

ID 



L 6 

LO 


O) a 












4? 

vp 


vP 

nP 

o'- 

nP 

vO 

nP 


O" 

vP 

vo 

OV 

00 

N- 

CM 

CD 

O 

T“ 

CD 

CM 

'M" 

CD 

CM 

— o 

o 

, r- 

CO 

CD 

o 

00 

00 

00 

ID 

M" 

o 

to l. 

-M L. 

m- 

CN 

CO 

CM 

M" 

CM 

CM 

CM 

CM 

CM 

oo 

O LU 

d 

d 

d 

d 

d 

d 

d 

d 

d 

d 

d 

»- 












0 












o v> 












> E 












o £ 

CN 

o 

o 

T — 

CM 

o 

T“ 

o 

O 

o 


CJ .2 












c < 












O 












i_ (A 

o ?> 












g = 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 














jS 

3 

CO 

«S| 

CO 

3 

3 

CO 

CO 

3 



CO 

15 

CO 

CO 

15 

<0 

CO 

0 

CO 

0 



T3 

XJ 

TJ 

■O 

*o 

“O 

T> 

■O 

"O 

T3 



■+-* 

j. ■ ■ 

i » 

+-* 


-4— » 

1 ■ 

f 


-*— ■» 



JZ 

JC 

-C 

JZ 

-C 

sz 

-C 

jr 

_£= 

JC 



CD 

O) 

cn 

CD 

CD 

CD 

CD 

.05 

CD 

CD 



d 

5= 

0= 

d 

0= 

d 

d 


d 

d 



To 

To 

To 

To 

To 

To 

To 

To 

To 

To 

42 


c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

CO 

c 

E 

E 

£ 

E 

£ 

£ 

E 

E 

E 

E 

|2 

U 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 


cl 


Z 

z 

z 

z 

z 

z 

z 

z 

z 


'C 

, 

. 

. 

. 

. 

. 


p 




o 

T“* 

CM 

T— 

CM 

— 

CO 

CM 

CO 

CM 

CM 

,p 


LU 

LU 

LU 

LU 

LU 

LU 

LU 

LU 

LU 

LU 

H— » 

0 

r- 


00 

00 

T— 

t — 

CM 

LO 

CD 

h- 

0 

o 

u- 

N. 

h- 

r^ 

OO 

OO 

CO 

CO 

CO 

00 

CD 


o 

o 

o 

o 

o 

o 

O 

O 

o 

O 

2 

00 

CO 

Cn 

cn 

cn 

cn 

cn 

cn 

cn 

cn 

cn 

0 

0 

1 — 

h- 

i — 

i — 

1— 

h- 

d 

i — 

1 — 

d 

s 

H 

C/) 

cn 

cn 

cn 

cn 

cn 

cn 

cn 

cn 

cn 



T— 

CM 

CO 


ID 

CD 

N- 

00 

03 

o 


resi 

No. 

o 

1 

Q 

O 

i 

Q 

p 

Q 

o 

1 

Q 

O 

i 

o 

o 

i 

Q 

O 

i 

Q 

o 

} 

Q 

o 

□ 

i 

Q 



CL 

CL 

CL 

i CL 

CL 

CL 

CL 

CL 

□l 

GL 



CN 

CN 

0 

i_ 

13 

gj 

Ll 
















U.S. Patent 


Jul. 12, 2005 


Sheet 24 of 36 


US 6,917,839 B2 


o 

o 

o 

o 

a 

o 

o 

o 

o 

O 

o 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

c/> 

CO 

CO 

<0 

(0 

CO 

CO 

0 

0 

CO 

E 

E 

E 

E 

E 

E 

E 

E 

E 

E 

6 


00 

o 

oo 

r- 


oo 

oo 

r^ 

CM 

CM 

CM 

CM 

o> 

CM 

CD 

r- 

CM 

N- 

N- 

CM 

O 

id 

id 


LO 



id 



id 

to 


-sP 



-p 

o N 

'V? 

a? 

-p 


-o 

0 s * 

VP 

CD 

00 

CO 

CO 

CO 

o 

O 

O) 

O 

O) 

CM 

LO 

in 

CO 

O) 

CO 

oo 

CM 

T — 


CD 

CM 

m 

1 ^ 

LO 

°0 

CD 



LO 


CO 

ID 

o 

o 

1 o 

o 

o 

o 

O 

o 

o 

o 

C> 


o o> o o o o 


ololol CM 


l_ (/> 

o © 

C^OOOOOO 


o o o o 


0 

0 

0 

0 

m 

0 

0 

0 

0 

0 

03 

03 

0 

0 

to 

0 

0 

CD 

0 

0 

X> 

~o 

~o 

~o 

x> 

"O 

x> 

"O 

“O 

~o 

-+_* 

-H 



*—• 


■ 4 ~* 




JO 

jC 

-C 

JZ 

sz 

jc 

sz 

JC 

JC 

JC 

o> 

CD 

CD 

CD 

o> 

CD 

CD 

CD 

CD 

CD 

Itr 

5 = 

«t= 




It 1 

5 = 

5 = 


0 

0 

"0 

"to 

TO 

TO 

TO 

TO 

TO 

"to 

a 

C 

c: 

c. 

C 

C 

O 

C 

C 

o 

E 

£ 

E 

E 

E 

E 

E 

E 

E 

E 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

z 

z 

Z 

Z 

Z 

z 

z 

Z 

z 

Z 


CM 


CM 

x — 

00 

CM 

00 

CM 

c\i 

LLl 

LU 

LU 

ULl 

LLl 

LU 

LU 

LU 

LU 

LU 


I s *- 

CD 

00 

7 — 


CM 

iO 

CO 

P- 

N. 

I s - 

N. 

N- 

00 

00 

00 

00 

00 

CO 

o 

o 

O 

O 

o 

o 

O 

O 

O 

o 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

1“ 

1 — 

H 

h~ 

1 — 

1 — 

1 — 

1 — 

\— 

1“ 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 

CO 


T- CNJ CO LO CD K 000)0 

OOOOOOO OO'T- 

I I I t I I I III 

OOOOOOO ooo 

lO UO UO UO CO LO lO LOLOlO 

<<<<<<< <<< 



U.S. Patent 


Jul. 12, 2005 


Sheet 25 of 36 


US 6,917,839 B2 


Error at 
Detect 

25.1% 

< 

Z 1 

1 

31 .0% 

V/N 

vO 

o'* 

in 

in 

3.8% 

12.1% 

0 s 

co 

CO 

9.2% 

4.5% 

17.8% 

13.1% 

Time to 
Detect 

125.68 sec 

95.04 sec 

1 54.92 sec 

— 

Missed 

Alarm 

27.40 sec 

18.92 sec 

60.32 sec 

43.24 sec 

46.20 sec 

22.72 sec 

89.04 sec 

65.4 sec 

One Cycl 
Alarms 


CM 

CM 

O 


CO 

..... 


23 

O 

M" 

00 

L ■ — ' — 

Average for All Drift Tests 

S nsor 
Failures 

1 (PID40) 

1 (PID40) 

1 (PID42) 

__ 

0 (Missed 
Alarm) 

1 (PID52) 

1 (PID58) 

1 (PID90) 

2 (PID100) 
& (PID58 
False Alarm) 

1 (PID105) 

O 

o 

CM 

O 

cl 

1 (PID205) 

i 

Test Description 

Drift PID40. 

0.14 pct/sec beginning at 10.0 sec. 

Noise PID40. 

±5 pet random beginning at 10.0 sec. 

Drift PID42. 

0.16 pct/sec beginning at 10.0 sec. 

Noise PID42. 

±5 pet random beginning at 10.0 sec. 

Drift PID52. 

1 1 .9 psi/sec beginning at 10.0 sec. 

Drift PID58. 

10.2 psi/sec beginning at 10.0 sec. 

Drift PID90. 

8.2 psi/sec beginning at 10.0 sec. 

Drift PID100. 

31.9 gpm/sec beginning at 10.0 sec. 

Drift PID1 05. 

3.4 degR/sec beginning at 10.0 sec. 

Drift PID200. 

6.3 psi/sec beginning at 10.0 sec. 

Drift PID205. 

2.7 degR/sec beginning at 10.0 sec. 

ri 


CM 

co 


in 

CO 


00 

ar> 

O 





T — 

T— 

T— 

t— 

T — 

v- 



CM 

CM 


4C. 

O 

o 

6 

o 

o 

6 

O 

6 

o 

O 

O 


"to 

in 

m 

uo 

in 

in 

m 

in 

LO 

m 

m 

in 



^ — 

r* 





X — 

T — 



T — 



< 

< 

< 

< 

< 

< 

< 

< 

< 

< 

< 



Figure 25. Comparative Results 
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Figure 27. Comparative Results 
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X-33 Fault Classification Submodel 
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Partitioned Model Under OPERATING Conditions 
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P(Sensor1= ABNORMAL)= 0.1156 
P(T1 Element = ABNORMAL)^ 0.0077 

HI, LI &T1 Drift High 

Sensorl Fault 

P(Sensor1= ABNORMAL)^ 0.9433 
P(H1 Element = ABNORMAL)^ 0.1363 
P(L1 Element = ABNORMAL)^ 0.1363 
P(T1 Element = ABNORMAL)= 0.1363 


Figure 33 
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Partitioned Model Under VENTING Conditions 

Fault Simulated 

Fault Classified 

Fault Classification Probabilities 

HI Drift High 

HI Element Fault 

P(H1 Element = ABNORMAL) 3 0.6779 
P(L1 Element = ABNORMAL) 3 0.0058 
P(T1 Element = ABNORMAL) 3 0.0058 
P(Sensor1= ABNORMAL) 3 0.0010 

LI Drift High 

LI Element Fault 

P(L1 Element = ABNORMAL) 3 0.6779 
P(H1 Element = ABNORMAL) 3 0.0058 
P(T1 Element = ABNORMAL) 3 0.0058 
P(Sensor1= ABNORMAL) 3 0.0010 

T1 Drift High 

T 1 Element Fault 

P(T1 Element = ABNORMAL) 3 0.6779 
P(H1 Element 3 ABNORMAL) 3 0.0058 
P(L1 Element 3 ABNORMAL) 3 0.0058 
PfSensorl = ABNORMAL) 3 0.0010 

HI & LI Drift High 

HI Element Fault 
LI Element Fault 

P(H1 Element = ABNORMAL) 3 0.6121 
P(L1 Element 3 ABNORMAL) 3 0.61 21 
PfSensorl 3 ABNORMAL) 3 0.1 156 
P(T 1 Element 3 ABNORMAL) 3 0.0077 

HI. LI &T1 Drift High 

Sensorl Fault 

P(Sensor1= ABNORMAL)= 0.9433 
P(H1 Element = ABNORMAL)= 0.1363 
P(L1 Element = ABNORMAL)^ 0.1363 
P(T 1 Element = ABNORMAL)= 0.1363 


Figure 34 
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Unpartitioned Model Under OPERATING Conditions 

Fault Simulated 

Fault Classified 

Fault Classification Probabilities 

HI Drift High 

HI Element Fault 

P(H1 Element = ABNORMAL)= 0.6779 
P(L1 Element = ABNORMAL)= 0.0058 
P(T1 Element = ABNORMAL)= 0.0058 
P(Sensor1= ABNORMAL)^ 0.0010 

LI Drift High 

LI Element Fault 

P(L1 Element = ABNORMAL)^ 0.6779 
P(H1 Element = ABNORMAL)^ 0.0058 
P(T1 Element = ABNORMAL)^ 0.0058 
P(Sensor1= ABNORMAL)= 0.0010 

Tt Drift High 

T1 Element Fault 

P(T1 Element = ABNORMAL)^ 0.6779 
P(H1 Element = ABNORMAL)= 0.0058 
P(L1 Element = ABNORMAL)= 0.0058 
P(Sensor1= ABNORMAL)^ 0.0010 

HI & LI Drift High 

HI Element Fault 
LI Element Fault 

P(H1 Element = ABNORMAL)= 0.6121 
P(L1 Element = ABNORMAL)= 0.6121 
P(Sensor1= ABNORMAL)^ 0.1156 
P(T 1 Element = ABNORMAL)= 0.0077 

HI, LI & Tt Drift High 

Sensorl Fault 

P(Sensor1= ABNORMAL)^ 0.9433 
P(H1 Element = ABNORMAL)^ 0.1363 
P(L1 Element = ABNORMAL)= 0.1363 
P(T 1 Element = ABNORMAL)= 0.1363 


Figure 35 Comparative Results 
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Unpartitioned Model Under VENTING Conditions 

Fault Simulated 

Fault Classified 

Fault Classification Probabilities 

HI Drift High 

HI Element Fault 

P(H1 Element = ABNORMAL) 3 0.6779 
P(L1 Element = ABNORMAL )= 0.0058 
P(T1 Element = ABNORMAL) 3 0.0058 
P(Sensor1= ABNORMAL) 3 0.0010 

LI Drift High 

Not Detected 

P(L1 Element = ABNORMAL) 3 0.1000 
P(H1 Element = ABNORMAL)= 0.1000 
P(T1 Element = ABNORMAL) 3 0.1000 
P(Sensor1= ABNORMAL) 3 0.0500 

T1 Drift High 

T1 Element Fault 

P(T1 Element = ABNORMAL) 3 0.6779 
P(H1 Element = ABNORMAL) 3 0.0058 
P(L1 Element = ABNORMAL) 3 0.0058 
P(Sensor1= ABNORMAL) 3 0.0010 

HI & LI Drift High 

HI Element Fault 
(LI Fault Not Detected) 

P(H1 Element = ABNORMAL) 3 0.6779 
P(L1 Element 3 ABNORMAL) 3 0.0058 
P(T1 Element = ABNORMAL) 3 0.0058 
P(Sensor1= ABNORMAL) 3 0.0010 

HI, LI &T1 Drift High 

HI Element Fault 
T1 Element Fault 
(LI Fault Not Detected) 

P(H1 Element = ABNORMAL) 3 0.6121 
P(T1 Element = ABNORMAL) 3 0.6121 
P(Sensor1= ABNORMAL) 3 0.1156 
P(L1 Element 3 ABNORMAL) 3 0.0077 


Figure 36 Comparative Results 
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SURVEILLANCE SYSTEM AND METHOD 
HAVING AN OPERATING MODE 
PARTITIONED FAULT CLASSIFICATION 
MODEL 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

This application is a continuation in part patent applica- 
tion of U.S. patent application Ser. No. 09/591,140, filed 
Jun. 9, 2000, now U.S. Pat. No. 6,609,036. 

STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH OR DEVELOPMENT 

The invention described herein was made in the perfor- 
mance of work under NASA Small Business Innovation 
Research (SBIR) Contracts NAS4-99012 and NAS13- 
01001, and is subject to the provisions of Public Law 96-517 
(35 USC 202) and the Code of Federal Regulations 48 CFR 
52.227-11 as modified by 48 CFR 1852.227-11, in which the 
contractor has elected to retain title. 

FIELD OF THE INVENTION 

The instant invention relates generally to a system and 
method for fault classification using operating mode parti- 
tioning and, in particular, to a system and method for 
performing high sensitivity surveillance of an asset such as 
a process and/or apparatus preferably having at least two 
distinct modes of operation wherein surveillance is per- 
formed using an operating mode partitioned fault classifi- 
cation model of the asset. 

BACKGROUND OF THE INVENTION 

Conventional process surveillance schemes are sensitive 
only to gross changes in the mean value of a process signal 
or to large steps or spikes that exceed some threshold limit 
value. These conventional methods suffer from either a large 
number of false alarms (if thresholds are set too close to 
normal operating levels) or from a large number of missed 
(or delayed) alarms (if the thresholds are set too 
expansively). Moreover, most conventional methods cannot 
perceive the onset of a process disturbance or sensor signal 
error that gives rise to a signal below the threshold level or 
an alarm condition. Most conventional methods also do not 
account for the relationship between a measurement by one 
sensor relative to another sensor. Further, most conventional 
methods provide no means to assess the most likely cause of 
a process disturbance or sensor signal error. For example, a 
process disturbance could result from any combination of an 
instrumentation problem, an equipment problem, or the 
process operating in a new or unexpected way. 

Recently, improved methods for process surveillance 
have developed from the application of certain aspects of 
artificial intelligence technology. Specifically, parameter 
estimation methods have been developed using either 
statistical, mathematical or neural network techniques to 
learn a model of the normal patterns present in a system of 
process signals. After learning these patterns, the learned 
model is used as a parameter estimator to create one or more 
virtual signals given a new observation of the actual process 
signals. Further, high sensitivity surveillance methods have 
been developed for detecting process and signal faults by 
analysis of a mathematical comparison between the actual 
process signal and its virtual signal counterpart. Moreover, 
automated decision making methods have been developed 
for reasoning about the cause of events or problems on the 
basis of their symptoms as represented in data. 


2 

Parameter estimation based surveillance schemes have 
been shown to provide improved surveillance relative to 
conventional schemes for a wide variety of assets including 
industrial, utility, business, medical, transportation, 
5 financial, and biological systems. However, parameter esti- 
mation based surveillance schemes have in general shown 
limited success when applied to complex processes. Appli- 
cant recognizes and believes that this is because the param- 
eter estimation model for a complex process must charac- 
10 terize the entire operating state space of the process to 
provide effective surveillance. Moreover, a review of the 
known prior- art discloses that virtually all such systems 
developed to date utilize a single model of the process to 
span the entire set of possible operating modes. Hence, a 
15 significant shortcoming of the known prior-art is that, inter 
alia, statistically derived models become extremely large 
and neural network models become difficult or impractical 
to train when the process operating state space is complex. 
The implication for statistically derived models is that the 
20 parameter estimation method and system becomes compu- 
tationally expensive to operate thereby limiting the utility of 
the method for on-line or real-time surveillance. An alter- 
native for statistically derived models is to constrain the size 
of the model; however this constraint limits the accuracy of 
25 the parameter estimation method and thereby limits the 
sensitivity of the surveillance method. The implication for 
mathematical and neural network models is simply that the 
parameter estimation method and system becomes less accu- 
rate thereby degrading the sensitivity of the surveillance 
30 method. These shortcomings in parameter estimation and 
the dependent capability for fault detection also reduce the 
utility, performance and benefit of automated decision mak- 
ing methods. Further, automated decision making itself 
becomes a much more complex and less reliable procedure 
35 when the process operating state space is complex. Auto- 
mated decision making when the process operating state 
space is complex often leads to conflicting and incompatible 
decision objectives and fault patterns when considering 
multiple operating modes of the process. In fact, automated 
40 decision making when the process operating state space is 
complex can become combinatorially infeasible to accom- 
plish with the reliability and confidence needed for practical 
use. 

Many attempts to apply parameter estimation, fault 
45 detection, and fault classification techniques to assets such 
as industrial, utility, business, medical, transportation, 
financial, and biological processes have met with poor 
results in part because the techniques used were expected to 
characterize the entire operating state space of the process. 
50 In one example, a multivariate state estimation technique 
(MSET) based surveillance system for the Space Shuttle 
Main Engine’s telemetry data was found to produce numer- 
ous false alarms when the learned MSET parameter estima- 
tion model was constrained to a size suitable for on-line, 
55 real-time surveillance. In this case, the surveillance system 
false alarm rate could be reduced by desensitizing the 
surveillance threshold parameters; however, the missed 
alarm rates then became too high for practical use in the 
telemetry data monitoring application. In another case, a 
60 Bayesian belief network fault classification system for the 
X-33 Single Stage to Orbit Demonstrator vehicle was found 
to classify fault indications incorrectly when multiple oper- 
ating modes of the system were represented in a single 
decision model. 

65 Moreover, current parameter estimation, fault detection, 
and fault classification techniques for surveillance of assets 
such as industrial, utility, business, medical, transportation, 
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financial, and biological processes fail to recognize the 
surveillance performance limitations that occur when it 
becomes necessary to trade-off decision processing speed 
against decision accuracy. This may be attributed, in part, to 
the relative immaturity of the field of artificial intelligence 
and computer-assisted surveillance with regard to real-world 
process control applications. Additionally, a general failure 
to recognize the specific limitations of trading off decision 
processing speed against decision accuracy for computer- 
assisted surveillance is punctuated by an apparent lack of 
known prior art teachings that address potential methods to 
overcome this limitation. In general, the known prior-art 
teaches computer-assisted surveillance solutions that are 
either applied globally to all operating modes of an asset or 
applied only to a single predominant operating mode, for 
example, applied only to steady state operations while 
neglecting all transient operating states of the asset. 

For the foregoing reasons, there is a need for a surveil- 
lance system and method that overcomes the significant 
shortcoming of the known prior- art as delineated herein- 
above. 

BRIEF SUMMARY OF THE INVENTION 

The instant invention is distinguished over the known 
prior art in a multiplicity of ways. For one thing, one 
embodiment of the invention provides a surveillance system 
and method that partitions decision models of an asset for 
overcoming a performance limiting trade-off between deci- 
sion processing speed and decision accuracy that has been 
generally unrecognized by the known prior art. Additionally, 
one embodiment of the invention can employ any one of a 
plurality of parameter estimation methods, fault detection 
methods, and fault classification methods and the decision 
models used therewith for improving surveillance perfor- 
mance. Furthermore, one embodiment of the invention pro- 
vides a surveillance system and method that provides an 
operating mode partitioned decision model that can be 
accomplished by observation and analysis of a time 
sequence of process signal data and by a combination of a 
plurality of techniques. 

Moreover, one embodiment of the invention provides a 
surveillance system and method that provides an operating 
mode partitioning of the decision model which enables 
different parameter estimation methods, fault detection 
methods, and fault classification methods to be used for 
surveillance within each individual operating mode of an 
asset. This ability enables surveillance to be performed by 
the instant invention with lower false alarm rates and lower 
missed alarm rates than can be achieved by the known 
prior- art methods. 

Hence, one embodiment of the invention provides a 
surveillance system and method that performs its intended 
function much more effectively by enabling higher decision 
processing speed without a concomitant reduction in deci- 
sion accuracy. Conversely, one embodiment of the invention 
alternately enables improved decision accuracy without a 
concomitant reduction in decision processing speed. 
Additionally, these competing criteria may be traded-off to 
achieve the optimal performance solution for a specific 
surveillance application. Furthermore, and in contrast to the 
known prior art, and in one embodiment of the invention, 
parameter estimation methods, fault detection methods, and 
fault classification methods may be individually tailored for 
each operating mode of the asset thereby providing addi- 
tional capability to reduce decision error rates for the 
surveillance system. 
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In one embodiment of the invention, the instant invention 
provides a surveillance system and method that creates and 
uses, for the purpose of asset surveillance, a coordinated 
collection of decision submodels wherein each decision 
5 submodel in the coordinated collection is optimized for a 
single operating mode or subset of operating modes of an 
asset. 

In another embodiment of the invention, an asset surveil- 
lance system is comprised of an operating mode partitioned 
10 fault classification model of an asset comprised of a plurality 
of fault classification submodels each having an asset oper- 
ating mode associated thereto; a fault indication means for 
determining one or more fault indications given a set of 
observed asset signals from the asset; means for determining 
at least one operating mode of the asset for the set of 
15 observed asset signals; a first selection means for selecting 
at least one of the fault classification submodels from the 
operating mode partitioned fault classification model as a 
function of at least the one determined operating mode for 
providing a fault classification of determined fault indica- 
20 tions for performing asset surveillance. The fault indication 
means further includes an operating mode partitioned 
parameter estimation model comprised of a plurality of 
parameter estimation submodels each having an asset oper- 
ating mode associated thereto and a second selection means 
25 for selecting at least one of the parameter estimation sub- 
models from the operating mode partitioned parameter esti- 
mation model as a function of at least the one determined 
operating mode. The fault indication means further includes 
means for processing the observed asset signals as a function 
30 of at least the one selected parameter estimation submodel 
for defining parameter estimated data. Additionally, the fault 
indication means includes an operating mode partitioned 
fault detection model comprised of a plurality of fault 
detection submodels each having an asset operating mode 
35 associated thereto. Furthermore, the fault indication means 
further includes a third selection means for selecting at least 
one of the fault detection submodels from the operating 
mode partitioned fault detection model as a function of at 
least the one determined operating mode. Moreover, the 
40 fault indication means further includes means for processing 
the parameter estimated data as a function of at least the one 
selected fault detection submodel for determining the one or 
more fault indications used for providing the fault classifi- 
cation of determined fault indications by the first selection 
45 means selecting at least one of the fault classification 
submodels from the operating mode partitioned fault clas- 
sification model as a function of at least the one determined 
operating mode for providing the fault classification of 
determined fault indications for performing asset surveil- 
50 lance- 

In another embodiment of the invention, a method for 
determining asset status includes the steps of creating an 
operating mode partitioned fault classification model com- 
prised of a plurality of fault classification submodels each 
55 having an asset operating mode associated thereto; acquiring 
a set of observed signal data values from an asset; deter- 
mining at least one fault indication as a function of the 
observed signal data values; determining at least one oper- 
ating mode of the asset for the set of observed asset signals; 
60 selecting at least one fault classification submodel from the 
operating mode partitioned fault classification model as a 
function of at least the one determined operating mode, and 
using at least the one fault indication and at least the one 
selected fault classification submodel for classifying faults 
65 for performing asset surveillance. 

In another embodiment of the invention, a method for 
determining asset status includes the steps of partitioning a 



US 6,917,839 B2 


5 

decision model into a plurality of partitions, each partition 
having an operating mode associated thereto: employing a 
plurality of different methods from a plurality of parameter 
estimation methods, a plurality of fault detection methods, 
and a plurality of fault classification methods for different 5 
partitions; determining at least one operating mode of an 
asset; selecting at least one the plurality of partitions as a 
function of the determined operating mode for tailoring the 
plurality of parameter estimation methods, the plurality of 
fault detection methods, and the plurality of fault classifi- 10 
cation methods to perform asset surveillance as a function of 
the at least one determined operating mode. 

In another embodiment of the invention, a method for 
determining asset status includes the steps of acquiring a set 
of observed signal data values from an asset; producing a 15 
calculated set of estimated signal data values correlative to 
the set of observed signal data values acquired from the 
asset; comparing the set of observed signal data values to the 
calculated set of estimated signal data values; determining a 
presence of a disagreement between the set of observed 20 
signal data values and the calculated set of estimated signal 
data values on the basis of the comparison step, and deter- 
mining a cause of a determined presence of disagreement 
between the set of observed signal data values and the 
calculated set of estimated signal data values for performing 25 
asset surveillance. The method further including the step of 
performing asset control as a function of the determined 
cause of the determined presence of disagreement. 

In another embodiment of the invention, a method and 
system for determining asset status includes the steps of 30 
creating a fault detection model comprised of a plurality of 
fault detection submodels each having an operating mode 
associated thereto; creating a fault classification model com- 
prised of a plurality of fault classification submodels each 
having an operating mode associated thereto; acquiring a set 35 
of observed signal data values from an asset; determining at 
least one operating mode of the asset for the set of observed 
signal data values; selecting at least one fault detection 
submodel from the fault detection model as a function of at 
least the one determined operating mode; determining at 40 
least one fault indication as a function of the observed signal 
data values; selecting at least one fault classification sub- 
model from the fault classification model as a function of at 
least the one determined operating mode, and using at least 
the one fault indication and at least the one selected fault 45 
classification submodel for classifying faults for performing 
asset surveillance. The method and system of further includ- 
ing the step of creating a parameter estimation model 
comprised of a plurality of parameter estimation submodels 
each correlative to at least one training data subset parti- 50 
tioned from an unpartitioned training data set and each 
having an operating mode associated thereto and wherein 
the step of determining at least one fault indication as a 
function of the observed signal data values includes the step 
of determining at least one fault indication as a function of 55 
both the estimated signal values determined using the 
parameter estimation model and the observed signal data 
values. 

Moreover, having thus summarized the invention, it 
should be apparent that numerous modifications and adap- 60 
tations may be resorted to without departing from the scope 
and fair meaning of the present invention as set forth 
hereinbelow by the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 65 

FIG. 1 is a schematic functional flow diagram of an 
embodiment of the invention. 
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FIG. 2 is a schematic functional flow diagram of a method 
and system for training an operating mode partitioned deci- 
sion model using recorded observations of the actual process 
signals in an embodiment of the invention. 

FIG. 3 is a schematic functional flow diagram of a method 
and system for performing surveillance of an asset using an 
operating mode partitioned decision model in an embodi- 
ment of the invention. 

FIG. 4 is a functional schematic of an operating mode 
partitioned training data set. 

FIG. 5 is a functional schematic of an operating mode 
partitioned decision model having an operating mode par- 
titioned parameter estimation model, an operating mode 
partitioned fault detection model, and an operating mode 
partitioned fault classification model. 

FIG. 6 is a functional schematic of an operating mode 
partitioned decision model having an operating mode par- 
titioned fault detection model, and an operating mode par- 
titioned fault classification model. 

FIG. 7 is a functional schematic of an operating mode 
partitioned fault classification model. 

FIG. 8 is a schematic functional flow diagram of a 
preferred method and system for classifying faults based on 
observed signal values for performing asset surveillance. 

FIG. 9 is a schematic functional flow diagram of a 
surveillance system using an operating mode partitioned 
decision model in an embodiment of the invention. 

FIG. 10 is a schematic functional flow diagram of a 
method and system for an unpartitioned MSET training 
procedure. 

FIG. 11 is a schematic functional flow diagram of a 
method and system for an unpartitioned MSET surveillance 
procedure. 

FIG. 12 illustrates the relationship between the overall 
MSET parameter estimation error and the number of obser- 
vation vectors used in the process memory matrix when 
unpartitioned parameter estimation methods are used for 
MSET training and surveillance; 

FIG. 13 illustrates the relationship between the data 
processing time required for producing an MSET parameter 
estimate and the number of observation vectors used in the 
process memory matrix when unpartitioned parameter esti- 
mation methods are used for MSET training and surveil- 
lance. 

FIG. 14 is a schematic functional flow diagram of the 
training procedure for an embodiment using an operating 
mode partitioned collection of MSET parameter estimation 
submodels in an embodiment of the invention. 

FIG. 15 is a schematic functional flow diagram of the 
surveillance procedure for an embodiment using an operat- 
ing mode partitioned collection of MSET parameter estima- 
tion submodels in an embodiment of the invention. 

FIG. 16 is a schematic architecture diagram of a learning 
vector quantization neural network useful for determining 
the operating mode of an asset in an embodiment of the 
invention. 

FIG. 17 lists the learning vector quantization neural 
network operating mode determiner design characteristics 
used for feasibility testing of an embodiment of the inven- 
tion. 

FIG. 18 lists the Space Shuttle Main Engine parameters 
used for feasibility testing in an embodiment of the inven- 
tion. 

FIG. 19 lists the Space Shuttle Main Engine flight telem- 
etry data sets used for feasibility testing of an embodiment 
of the invention. 
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FIG. 20 lists the parameter estimation model and sub- 
model configurations used for feasibility testing of an 
embodiment of the invention. 

FIG. 21 lists the Space Shuttle Main Engine operating 
mode partitioning rules used for feasibility testing of an 
embodiment of the invention. 

FIG. 22 lists the feasibility test results for nominal flight 
data using an embodiment of an operating mode partitioned 
MSET estimation model for the Space Shuttle Main Engine 
in an embodiment of the invention. 

FIG. 23 lists the feasibility test results for signal drift 
failure detection simulations using an embodiment of an 
operating mode partitioned MSET estimation model for the 
Space Shuttle Main Engine in an embodiment of the inven- 
tion. 

FIG. 24 lists the comparative test results for nominal flight 
data using an unpartitioned MSET estimation model for the 
Space Shuttle Main Engine containing one hundred fifty 
observation vectors. 

FIG. 25 lists the comparative test results for signal drift 
failure detection using a unpartitioned MSET estimation 
model for the Space Shuttle Main Engine containing one 
hundred fifty observation vectors. 

FIG. 26 lists the comparative test results for nominal flight 
data using a unpartitioned MSET estimation model for the 
Space Shuttle Main Engine containing three hundred obser- 
vation vectors. 

FIG. 27 lists the comparative test results for signal drift 
failure detection using a unpartitioned MSET estimation 
model for the Space Shuttle Main Engine containing three 
hundred observation vectors. 

FIG. 28 illustrates a mathematical parameter estimation 
model of a type used for Space Shuttle Main Engine 
telemetry data surveillance in an another embodiment. 

FIG. 29 is a schematic functional flow diagram of a 
surveillance system having two X-33 Single Stage to Orbit 
Demonstrator vehicle hydrogen sensors. 

FIG. 30 is a functional schematic of a fault classification 
submodel for a system having two X-33 Single Stage to 
Orbit Demonstrator vehicle hydrogen sensors. 

FIG. 31 illustrates faults and their probable causes for 
hydrogen sensor 1 using a Bayesian Belief Network in an 
embodiment of the invention. 

FIG. 32 illustrates faults and their probable causes for 
hydrogen sensor 2 using a Bayesian Belief Network in an 
embodiment of the invention. 

FIG. 33 lists the feasibility test results for a partitioned 
model under operating conditions in an embodiment of the 
invention. 

FIG. 34 lists the feasibility test results for a partitioned 
model under venting conditions in an embodiment of the 
invention. 

FIG. 35 lists the comparative test results for an unparti- 
tioned model under operating conditions. 

FIG. 36 lists the comparative test results for an unparti- 
tioned model under venting conditions. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Considering the drawings, wherein like reference numer- 
als denote like parts throughout the various drawing figures, 
reference numeral 10 is directed to the system according to 
the instant invention. 

In its essence, and referring to FIG. 1, the system 10 is 
generally comprised of a method and apparatus for perform- 
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ing high sensitivity surveillance of a wide variety of assets 
including industrial, utility, business, medical, 
transportation, financial, and biological processes and appa- 
ratuses wherein such process and/or apparatus asset prefer- 
5 ably has at least two distinct modes or domains of operation 
(e.g., transient and steady state modes or domains). The 
system includes a training procedure 20 wherein a decision 
model 50 of an asset 12 (e.g., a process and/or apparatus) is 
derived from historical operating data using at least one of 
a plurality of computer-assisted techniques. Historical oper- 
ating data includes a set of observations from normal 
operation of the asset 12 that is acquired and digitized by a 
data acquisition means 40 using any combination of elec- 
tronic data acquisition hardware and signal processing soft- 
5 ware known to those having ordinary skill in the art, and 
informed by the present disclosure. Additionally, and as 
delineated infra, one hallmark of the instant invention is an 
operating mode partitioning method of a decision model 50 
for the asset 12 that is performed during the training pro- 
2Q cedure 20. 

The system 10 further includes a surveillance procedure 
60 wherein the operating mode partitioned decision model 
50 is used for high sensitivity computer-assisted surveillance 
of the asset 12 for the purpose of determining whether a 
25 process fault or failure necessitates an alarm or control 
action. Another hallmark of the instant invention, as delin- 
eated hereinbelow, is the use of the operating mode parti- 
tioned decision model 50 as an element of the surveillance 
procedure 60. The system 10 described herein is useful for 
3 q ultra-sensitive detection of the onset of sensor or data signal 
degradation, component performance degradation, and pro- 
cess operating anomalies. 

Description of the Training Procedure: 

More specifically, and referring to FIG. 2, the training 
35 procedure 20 of the system 10 includes a method and 
apparatus for training or preparing the decision model 50 
using historical operating data from the asset 12 that has 
been acquired by the data acquisition means 40 using any 
combination of conventional electronic data acquisition 
40 hardware and signal processing software as is well known in 
the art. Upon acquiring the data, the model designer pro- 
ceeds to implement the unique method for the training 
procedure 20 in accordance with instant invention. The 
historical operating data is acquired in digital format and 
45 stored using a data storage procedure 22. The unique method 
for the training procedure 20 uses an operating mode deter- 
mination procedure 26 to partition the historical operating 
data into one or more training data subsets 28 that together 
comprise a training data set 24 wherein each training data 
50 subset 28 is representative of a single operating mode i (M f ) 
wherein M, is any mode between Mode 1 (M 1 ) to Mode N 
(M^) where N is a positive integer or each training data 
subset 28 is representative of a subset of operating modes of 
the asset 12. The training data set 24 includes at least K 
55 discrete observations of the asset 12 wherein each single 
observation, herein denoted X obs , is comprised of a vector of 
data values for at least each signal parameter to be included 
in the decision model 50. For the purposes of the training 
procedure 20, the number of observations, K, acquired is at 
60 least great enough to adequately bound the operating state 
space of the asset 12. Thus, the training data set 24 provides 
a representative sample of the signals produced by the asset 
12 during all normal modes of operation. 

Again referring to FIG. 2, the unique method for the 
65 training procedure 20 also includes at least one of a param- 
eter estimation submodel creation procedure 29, a fault 
detection submodel creation procedure 30, or a fault clas- 
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sification submodel creation procedure 31 for creating at 
least one decision submodel for inclusion in the decision 
model 50 using at least one training data subset 28. In 
practice, the designer first selects the operating modes that 
will be included in the decision model 50 by means of an 5 
operating mode enable procedure 32. The method thereafter 
is comprised of a training loop wherein each possible 
operating mode of the asset 12 is assessed for inclusion in 
the decision model 50. 

The training loop is in general controlled by two decision 10 
procedures. The mode enabled decision procedure 34 deter- 
mines whether the designer intends a specific operating 
mode to be included in the decision model 50. If the 
operating mode is not to be included, no further processing 
is required and the training loop proceeds to the next 15 
possible operating mode as controlled by the more modes 
decision procedure 36. If the operating mode is to be 
included, the training data subset 28 associated with the 
currently selected operating mode is selected from the 
training data set 24. Depending on the preference of the 20 
designer implementing the training loop, the operating mode 
determination and training data subset extraction procedures 
may be, in general, performed as needed or in advance of the 
submodel creation loop. The submodel creation loop shown 
in FIG. 2 illustrates operating mode determination and 25 
training data subset extraction in advance of implementing 
the submodel creation loop but is not intended to constrain 
the method to preclude determination and extraction on an 
as needed basis. The result of each submodel creation loop 
is the addition of one or more submodels to the decision 30 
model 50. 

Still referring to FIG. 2, the operating mode determination 
procedure 26 used to classify each observation included in 
the training data set 24 may be, in general, performed using 
any method suitable for determining the operating mode of 35 
the asset 12 given an observation or series of observations 
therefrom. Methods suitable for the operating mode deter- 
mination procedure 26 include, but are not limited to, a 
plurality of mathematical or logic sequence techniques, a 
plurality of expert system techniques, a plurality of fuzzy 40 
logic techniques, a plurality of determined similarity 
techniques, a plurality of clustering techniques, and a plu- 
rality of neural network techniques. 

Continuing to refer to FIG. 2, the parameter estimation 
submodel creation procedure 29 may be, in general, per- 45 
formed using any method suitable for defining a parameter 
estimation model useful for estimating the values of one or 
more observed signals. Methods suitable for the parameter 
estimation submodel creation procedure 29 include, but are 
not limited to, a plurality of multivariate state estimation 50 
techniques, a plurality of neural network techniques, a 
plurality of mathematical model techniques, a plurality of 
autoregressive moving average techniques, a plurality of 
principal component analysis techniques, a plurality of 
independent component analysis techniques, a plurality of 55 
determined similarity techniques, and a plurality of Kalman 
filter techniques. Each parameter estimation submodel con- 
tained in the decision model 50 may be created to implement 
any of a plurality of parameter estimation techniques. 
Further, the parameter estimation technique implemented 60 
for an individual submodel is not constrained to be the same 
as the parameter estimation technique implemented for any 
other submodel contained in the decision model 50. 

Continuing to refer to FIG. 2, the fault detection submodel 
creation procedure 30 may be, in general, performed using 65 
any method suitable for defining a fault detection model 
useful for detecting fault indications on the basis of the 
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values of one or more observed signals. Methods suitable for 
the fault detection submodel creation procedure 30 include, 
but are not limited to, a plurality of sequential probability 
ratio test techniques, a plurality of hypothesis test 
techniques, a plurality of neural network techniques, a 
plurality of mathematical model techniques, a plurality of 
comparison threshold techniques, a plurality of limit com- 
parison techniques, a plurality of determined similarity 
techniques, and a plurality of trend analysis techniques. 
Each fault detection submodel contained in the decision 
model 50 may be created to implement any of a plurality of 
fault detection techniques. Further, the fault detection tech- 
nique implemented for an individual submodel is not con- 
strained to be the same as the fault detection technique 
implemented for any other submodel contained in the deci- 
sion model 50. 

Continuing to refer to FIG. 2, the fault classification 
submodel creation procedure 31 may be, in general, per- 
formed using any method suitable for defining a fault 
classification model useful for determining the presence, 
source or cause of an unacceptable asset status or condition 
on the basis of one or more fault indications. Methods 
suitable for the fault classification submodel creation pro- 
cedure 31 include, but are not limited to, a plurality of 
Bayesian belief network techniques, a plurality of neural 
network techniques, a plurality of decision tree techniques, 
a plurality of expert system techniques, a plurality of rule- 
based techniques, a plurality of determined similarity 
techniques, a plurality of hypothesis test techniques, and a 
plurality of procedural logic techniques. Each fault classi- 
fication submodel contained in the decision model 50 may 
be created to implement any of a plurality of fault classifi- 
cation techniques. Further, the fault classification technique 
implemented for an individual submodel is not constrained 
to be the same as the fault classification technique imple- 
mented for any other submodel contained in the decision 
model 50. 

The unique method for the training procedure 20 is 
completed at training complete point 37 when all expected 
operating modes of the asset 12 have been assessed. At this 
point, the decision model 50 includes parameter estimation, 
fault detection, and/or fault classification submodels for 
each operating mode enabled by the designer. The decision 
model 50 is thereafter useful for performing surveillance of 
the asset 12. 

Description of the Surveillance Procedure: 

More specifically, and referring to FIG. 3, the surveillance 
procedure 60 is comprised of acquiring successive vectors of 
operating data and determining for each such observation 
vector whether the operating data is indicative of an unac- 
ceptable status or condition of the asset 12. The surveillance 
procedure 60 further includes implementing an alarm or 
control action for the purpose of notifying an operator or 
taking a corrective action in response to a detected unac- 
ceptable status or condition of the asset 12. The surveillance 
procedure 60 is in general an open-ended data acquisition 
and analysis loop that continues until such time as the 
operator chooses to terminate the surveillance. 

Again referring to FIG. 3, the surveillance procedure 
begins with an observation acquisition procedure 62 for 
acquiring a vector of observed signal data values, herein 
denoted X obs . Signal data values are acquired by the data 
acquisition means 40 using any combination of conventional 
electronic data acquisition hardware and signal processing 
software as noted supra. Next the operating mode determi- 
nation procedure 26 is used to determine the operating mode 
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for the vector of observed signal data values, X obs . It is 
essential only that the operating mode determination proce- 
dure 26 used during the surveillance procedure 60 is the 
same operating mode determination procedure 26 used 
during the training procedure 20. Upon determination of the 
operating mode associated with the observed signal data, the 
decision submodels for the current operating mode are 
selected from the collection of submodels contained in the 
decision model 50 using a decision submodel selection 
procedure 64. The selected decision submodels for the 
current operating mode may then be used with a parameter 
estimation procedure 66 to produce a current vector of 
estimated signal data values, herein denoted X est . It is 
essential only that the parameter estimation procedure 66 
used during the surveillance procedure 60 is the same 
parameter estimation procedure 66 for which the decision 
submodel was trained using the parameter estimation sub- 
model creation procedure 29 during the training procedure 
20. The current vector of estimated signal data values, X est , 
in general includes at least one estimated signal data value 
corresponding to at least one actual signal data value 
included in the current vector of observed signal data values, 
X obs . A series of estimated signal data values produced by 
successive observation and parameter estimation cycles is 
termed herein a “virtual signal” for the signal parameter. 

Still referring to FIG. 3, the current vector of estimated 
signal data values, X est , may be in general compared to the 
current vector of observed signal data values, X obs , using a 
fault detection procedure 68. The fault detection procedure 
68 serves the useful purpose of determining whether the 
current vector of observed signal data values indicates an 
unacceptable status or condition of the asset 12. The fault 
detection procedure 68 may be performed using any one of 
a plurality of comparative techniques. 

The results of the fault detection procedure 68 might 
detect faults based on the current vector of observed signal 
data values. In many cases, fault detection quality is 
improved by using a fault indication decision procedure 70 
that incorporates logic for considering one or more fault 
detection results in making the fault indication decision. The 
fault indication decision procedure 70 may be in general 
performed using any method suitable for ascertaining a fault 
indication given a fault detection result or series of fault 
detection results. Methods suitable for the fault indication 
decision procedure 70 include, but are not limited to, single 
observation techniques (e.g., alarm on every detected fault), 
multi-observation voting techniques (e.g., alarm when X out 
of Y observations contain a fault indication), and conditional 
probability techniques (e.g., compute the fault probability 
given a series of fault detection results). 

When faults are indicated by the fault indication decision 
procedure 70, the unique method for the surveillance pro- 
cedure 60 provides for a fault classification procedure 76. 
The fault classification procedure is useful for determining 
the presence, source or cause of an unacceptable asset status 
or condition on the basis of one or more fault indications. 
The classified fault is then provided to the alarm or control 
action procedure 74 for the useful purpose of enabling an 
automated or operator directed corrective action or warning. 

Upon completing the fault indication decision procedure 
70 or the alarm or control action procedure 74, the surveil- 
lance procedure then repeats for as long as a more data 
decision procedure 72 determines that additional surveil- 
lance data are available or terminates at surveillance com- 
plete step 75 when no more surveillance data are available. 

Continuing to refer to FIG. 3, the usefulness of the instant 
invention is, inter alia, the improvement achieved in the 
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accuracy of the fault decision made by the fault classifica- 
tion procedure 76. Improving the accuracy of the fault 
classification procedure 76 accomplishes a reduction in the 
number of false alarms sent to a process operator or control 
5 system that can in turn result in an erroneous alarm or 
control action by the alarm or control action procedure 74. 
Further, improving the accuracy of the fault classification 
procedure 76 accomplishes a reduction in the number of 
missed alarms thereby accomplishing more timely alarm or 
control action by the alarm or control action procedure 74. 
The instant invention thereby enables improved operating 
safety, improved efficiency and performance, and reduced 
maintenance costs for a wide variety of industrial, utility, 
business, medical, transportation, financial, and biological 
processes and apparatuses wherein such process and/or 
15 apparatus asset 12 preferably has at least two distinct modes 
or domains of operation. 

FIG. 4 shows the training data set 24 partitioned into a 
plurality of training data subsets 28 wherein the operating 
mode associated with each training data subset 28 is deter- 
20 mined using the operating mode determination procedure 
26. 

FIG. 5 shows an example of the decision model 50 that is 
comprised of the parameter estimation model 52, the fault 
detection model 54, and the fault classification model 56. 
25 FIG. 5 further shows that the parameter estimation model 52 
is comprised of at least one parameter estimation submodel 
53, that the fault detection model 54 is comprised of at least 
one fault detection submodel 55, and that the fault classifi- 
cation model 56 is comprised of at least one fault detection 
30 submodel 57. 

FIG. 6 shows an example of the decision model 50 that is 
comprised of the fault detection model 54 and the fault 
classification model 56. FIG. 6 is intended to illustrate that 
any of the parameter estimation model 52, the fault detection 
model 54, and/or the fault classification model 56 might not 
be used in some cases for accomplishing the asset surveil- 
lance function. 

FIG. 7 shows an example of the fault classification model 

56. FIG. 7 is intended to illustrate that the each of the fault 

40 

detection submodels 57 might be uniquely configured to 
more accurately accomplish the asset surveillance function 
for a particular operating mode. For example, in FIG. 7 each 
of the fault classification submodels shown are uniquely 
45 configured. 

Referring to FIG. 7, Submodel 1 will typically operate as 
follows. If only Indication 1 is abnormal the condition will 
be classified as Fault 1. If only Indication 2 is abnormal the 
condition will be classified as Fault 2. If both Indication 1 
50 and Indication 2 are abnormal the condition will be classi- 
fied as Fault 3. 

In contrast and still referring to FIG. 7, Submodel i will 
typically operate as follows. If only Indication 1 is abnormal 
the condition will be classified as Fault 1. If only Indication 
55 2 is abnormal the condition will be classified as Fault 2. If 
both Indication 1 and Indication 2 are abnormal the condi- 
tion will be classified as both Fault 1 and Fault 2. 

In additional contrast and still referring to FIG. 7, Sub- 
model N will typically operate as follows. If only Indication 
60 1 is abnormal the condition will be classified as Fault 1. If 
only Indication 2 is abnormal the condition will be classified 
as either Fault 3 or as no fault depending on the importance 
of Indication 1 for confirming the presence of Fault 3. If both 
Indication 1 and Indication 2 are abnormal the condition will 
65 be classified as Fault 3. 

Thus, still referring to FIG. 7, the behavior of the fault 
classification procedure 76 using the fault classification 
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model 56 can be tailored to suit the decision model 50 
designer’s requirements for the asset surveillance function. 

FIG. 8 shows the steps used to determine the status or 
condition of the asset 12 using the classification of faults 
during asset surveillance. The first step is to acquire 5 
observed signal values from the asset 12 using the obser- 
vation acquisition procedure 62. The second step is to 
determine corresponding estimated signal values using the 
parameter estimation procedure 66. The third step is to 
determine the presence of any fault indications using the 10 
fault detection procedure 68. The fourth step is to classify 
the presence, source and/or cause of the fault indications, if 
any, using the fault classification procedure 76. These steps 
repeat until terminated by the more data decision procedure 

72 ' . 15 

FIG. 9 outlines a general surveillance procedure of the 

system 10 when employing the operating mode partitioned 
decision model 50. In a typical surveillance procedure, the 
asset 12 is the source of at least one signal source 42 that is 
acquired and digitized using conventional data acquisition 2 o 
means 40 for providing the data acquisition procedure for 
the purpose of computer-assisted surveillance. The digitized 
signal data is generally evaluated using a computer 44 
having computer software modules implementing the vari- 
ous procedures describe supra, such as the operating mode 2 s 
determination procedure 26, and further providing the 
memory means for the decision model 50. The operating 
mode determination procedure 26 is used to determine the 
current operating mode of the asset 12 given the acquired 
process signal data. The decision model 50 provides the 30 
operating mode partitioned parameter estimation model 52 
that in turn provides the parameter estimation submodel 53 
that is used to produce an estimated signal value for at least 
one signal source 42 emanating from the asset 12. The 
parameter estimation submodel 53 in general uses the 35 
parameter estimator procedure 66 to produce the estimated 
signal values. The parameter estimation submodel 53 
selected from the decision model 50 and used by the 
parameter estimation procedure 66 is dependent on the 
operating mode determined by the operating mode determi- 40 
nation procedure 26. 

The observed signal values and/or the estimated signal 
values are then generally evaluated to identify the presence 
of any unacceptable status or condition of the asset 12. The 
decision model 50 provides the operating mode partitioned 45 
fault detection model 54 that in turn provides the fault 
detection submodel 55 that is used to detect the indications 
of any unacceptable status or condition of the asset 12. The 
fault detection submodel 55 in general uses the fault detec- 
tion procedure 68 to detect the indications of faults. The fault 50 
detection submodel 55 selected from the decision model 50 
and used by the fault detection procedure 68 is dependent on 
the operating mode determined by the operating mode 
determination procedure 26. 

The fault indications, if any, are then generally evaluated 55 
to classify the presence, source and/or cause of any unac- 
ceptable status or condition of the asset 12. The decision 
model 50 provides the operating mode partitioned fault 
classification model 56 that in turn provides the fault clas- 
sification submodel 57 that is used to classify any unaccept- 60 
able status or condition of the asset 12. The fault classifi- 
cation submodel 57 in general uses the fault classification 
procedure 76 to classify the indications of faults. The 
specific fault classification submodel 57 selected from the 
decision model 50 and used by the fault classification 65 
procedure 76 is dependent on the operating mode deter- 
mined by the operating mode determination procedure 26. 
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The results of the fault classification are thereafter com- 
municated by a conventional communications link 80 (as is 
known to those having ordinary skill in the art, and informed 
by the present disclosure) to an operator console 82 or 
automated process control system 84 for possible alarm 
and/or control action. 

The computer 44 along with its typically associated 
memory means can also be employed to perform the training 
and surveillance procedures 20, 60 as delineated supra and 
to store all the data associated with these procedures, for 
example, the historical operating data, the training data and 
decision model. 

MSET Procedure: 

In an embodiment of the invention, the method used for 
parameter estimation is a multivariate state estimation tech- 
nique (MSET) procedure. The US Department of Energy’s 
Argonne National Laboratory developed the implementation 
of MSET described herein for surveillance of sensors and 
components in nuclear power plant applications. However, 
other implementations of a multivariable state modeling 
technique, for example multivariable linear regression, are 
possible and useful in conjunction with the instant invention. 
MSET is in general a statistically derived parameter esti- 
mation algorithm that uses advanced pattern recognition 
techniques to measure the similarity or overlap between 
signals within a defined operational domain wherein the 
domain is defined by a set of operating examples. MSET 
“learns” patterns among the signals by numerical analysis of 
historical process operating data. These learned patterns or 
relationships among the signals are then used to estimate the 
expected signal values that most closely corresponds with a 
new signal data observation. By quantifying the relationship 
between the current and learned states, MSET estimates the 
current expected response of the process signals. MSET 
parameter estimates are then used with a form of statistical 
hypothesis testing, such as the sequential probability ratio 
test (SPRT) or similar probability ratio test algorithm (as 
shown in U.S. Pat. No. 5,459,675 and which is hereby 
incorporated by reference in its entirety) to compare the 
current estimated value of a signal with its observed value. 
The statistical hypothesis comparison test provides a sensi- 
tive and widely applicable method to detect a fault or failure 
in an asset. However, other implementations of the com- 
parison test are possible and useful in conjunction with the 
instant invention. 

An MSET parameter estimation model is created for the 
asset 12 using the MSET training algorithms to learn the 
inherent data relationships within a set of historical process 
operating data. A SPRT fault detection model is calibrated 
using the MSET parameter estimation model and the set of 
historical process operating data. The trained MSET model 
is then used with the MSET parameter estimation procedure 
and the SPRT fault detection procedure to perform the 
process surveillance function when presented with a new 
observation of signal data values. The following sections 
will first provide a mathematical overview of the MSET 
algorithms and procedures useful for training a parameter 
estimation model and for using this trained model for 
process surveillance. The description is followed by a 
detailed description of a preferred embodiment of the instant 
invention using a novel operating mode partitioned param- 
eter estimation model for asset surveillance. 

Description of the MSET Training and Surveillance Pro- 
cedures: 

The MSET methods are generally described in the fol- 
lowing two US Government documents produced and main- 
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tained by the US Department of Energy’s Argonne National 
Laboratory, Argonne, 111., disclosure of which is incorpo- 
rated in its entirety herein by reference. The MSET methods 
were embodied in MSET software provided by Argonne 
National Laboratory under NASA Contracts NAS4-99012 5 
and NAS13-01001 and were not modified or in themselves 
improved for the purposes of a-preferred embodiment. 

J. P. Herzog, S. W. Wegerich, R. M. Singer, and K. C. 
Gross, “Theoretical Basis of the Multivariate State Estima- 
tion Technique (MSET),” Argonne National Laboratory, 1° 
ANL-NT-49, December 1997. 

J. P. Herzog, S. W. Wegerich, K. C. Gross, and R. M. 
Singer, “MSET: Code Structure and Interface Development 
Guide,” ANL-NT-48, August 1997. 

The MSET algorithm uses pattern recognition with his- 15 
torical operating data from an asset to generate one form of 
a parameter estimation model. If data are collected from a 
process over a range of operating points, these data can be 
arranged in matrix form, where each column vector (a total 
of m) in the matrix represents the measurements made at a 
particular point. Thus, this matrix will have the number of 
columns equal to the number of operating points at which 
observations were made and the number of rows equal to the 
number of measurements (a total of n signal data values) that 
were available at each observation. We begin by defining the 
set of measurements taken at a given time tj as an observa- 
tion vector X(ty), 


Mtj)=[xi(tj),x 2 (tj), ■ ■ ■ , x„(tj)Y (i) 

where x/ty) is the measurement from signal i at time ty. We 
then define the data collection matrix as the process 
memory matrix D: 
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Each of the column vectors (X(ty)) in the process memory 
matrix represents an operating point of the process. Any 
number of observation vectors can be assigned to the 
process memory matrix. Training an MSET model includes 
collecting enough unique observation vectors from histori- 45 
cal operation of the process during normal conditions such 
that the process memory matrix encompasses the full 
dynamic operating range of the process. Computation of the 
D matrix is the first of three steps in the method for training 
an MSET model based on historical operating data. 50 

One of at least two algorithms is used by MSET to select 
the vectors in the D matrix. The MinMax algorithm extracts 
vectors that bound the vector space defined by the training 
data and returns the smallest process memory matrix that 
will produce an effective system model (see also U.S. Pat. 55 
No. 5,764,509 and which is hereby incorporated by refer- 
ence in its entirety). The Vector Ordering algorithm selects 
and includes representative vectors from the inner regions of 
the vector space producing a more accurate system model. 

Once the process memory matrix has been constructed, 60 
MSET is used to model the dynamic behavior of the system. 

For each current observation of the system (X ofey ), MSET 
compares the observation vector to the stored operating 
points to calculate an estimate of the process parameter 
values. The parameter estimate of the current process state 65 
(X esf ) is an n-element vector that is given by the product of 
the process memory matrix and a weight vector, W: 


X es = DW (3) 

The weight vector represents a measure of similarity 

between the estimate of the current operating point and the 
elements of the process memory matrix. To obtain the 
weight vector, we minimize the error vector, R, where: 

R = x 0 te -X eS f (4) 

The error is minimized for a given operating point when: 

W=(D r © D) _1 (D r © X obs ) (5) 

This equation represents a “least squares” minimization 
when the pattern recognition operator (x) is the matrix dot 
product. The Argonne MSET software includes a choice of 
several pattern recognition operators that provide excellent 
parameter estimation performance (for example, see U.S. 
Pat. No. 5,764,509 and U.S. Pat. No. 5,987,399 each hereby 
incorporated by reference in their entirety). 

Once the weight vector is found, the resulting current 
estimate of the signal data values (i.e., the parameter esti- 
mate vector) is given by: 

X„,-D'(D r © D^'CD r © X obs ) (6) 

The first application of the pattern recognition operator in 
equation (6) (D r (x) D) involves a comparison between the 
row vectors in the D r matrix and each of the column vectors 
in the D matrix. If we define G=D r (x) D, then G, the 
similarity matrix, is an m by m matrix. The element in the 
i-th row and j-th column of the matrix (g^) represents a 
measure of the similarity between the i-th and j-th column 
vectors (i.e., memorized operating points) in the process 
memory matrix. The second application of the pattern rec- 
ognition operator in equation (6) (D r (x)X ofc5 ) involves a 
comparison between the row vectors in the D r matrix and 
each of the elements in the observation vector X^. If we 
define A=D r (x) X obs , then A, the similarity vector, is an m 
by 1 vector. Each element in the similarity vector is a 
measure of the similarity between the observation vector and 
the i-th column vector (i.e., memorized operating points) in 
the process memory matrix. 

Note that the similarity matrix is a function of the process 
memory matrix only. Thus, the similarity matrix and its 
inverse Ginv=(D r (x)D) _1 can be calculated as soon as the 
process memory matrix has been derived thereby making the 
application of MSET to an on-line surveillance system more 
computationally efficient. Computation of the Ginv matrix 
initializes the parameter estimation model and completes the 
second of three steps in the procedure for training an MSET 
model based on historical operating data. 

The third and final step in the training procedure includes 
analyzing the historical training data using equation (6) to 
characterize the expected statistical mean and variance of 
the residual error vector, R, for each signal parameter in the 
observation vector. The resulting mean vector, M, is later 
used in the surveillance procedure to normalize the residual 
error for each observation evaluated using the statistical 
hypothesis test fault detection procedure. The resulting 
variance vector, V, is later used at the beginning of the 
surveillance procedure to initialize the fault detection thresh- 
old values used in the statistical hypothesis test fault detec- 
tion procedure. 

FIG. 10 illustrates the procedure for training an MSET 
parameter estimation model. The procedure is used to pro- 
duce an unpartitioned MSET model 102 that is not parti- 
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tioned by operating mode. The MSET training procedure 
developed by Argonne National Laboratory (ANL) as 
described herein is embodied in one instance within the ANL 
software modules known as train. c and sys_mod.c. As 
described herein above, the MSET training procedure begins 
with a MSET model extraction procedure 90 used to popu- 
late a process memory matrix 92 (D) from the training data 
set 24 (historical process operating data). The MSET model 
extraction procedure 90 makes use of at least one of a 
plurality of observation vector extraction methods embodied 
in one instance within the ANL train. c software module, 
including but not limited to the MinMax method, and the 
Vector Ordering method. A MSET model initiation proce- 
dure 94 is the second step of the method and is used to 
initialize the MSET decision model by the computation of a 
inverse similarity matrix 96 (Ginv). The MSET model 
initiation procedure 94 makes use of at least one of a 
plurality of pattern recognition operator methods embodied 

in one instance within the ANL sys mod.c software 

module, including but not limited to the SSA method, the 
BART method, the VPR method, the VSET method, and the 
PSEM method. The third step of the MSET training proce- 
dure uses the process memory matrix 92 and the inverse 
similarity matrix 96 to perform a MSET training data 
analysis procedure 98 using the training data set 24. The 
training data analysis procedure 98 uses the MSET model to 
computes the residual error mean and variance vectors 100 
(M and V, respectively) over the training data. The training 
procedure is in general performed once for the training data 
set 24 thus preparing an unpartitioned MSET model 102 for 
use in the MSET surveillance procedure. 

In the MSET surveillance procedure, new operating data 
observations are evaluated sequentially using the unparti- 
tioned MSET model 102 for the purposes of validating the 
data or discerning an anomalous (not normal) process oper- 
ating condition. For each new observation vector, X obs , 
presented to the MSET parameter estimation method, the 
expected operating state having the greatest similarity to the 
current observed state is returned as a parameter estimate 
vector, X^ f . Diagnostic decisions are then made on the basis 
of the difference (residual error) between the observed and 
estimated values for at least one process signal parameter 
contained in the estimate vector. Fault indications are deter- 
mined using at least one of a plurality of fault detection 
methods including, but not limited to, a threshold limit test 
method, a Sequential Probability Ratio Test (SPRT) method, 
and a Bayesian Sequential Probability (BSP) test method to 
produce a fault indication based on the value of the residual 
error for at least one process parameter. 

FIG. 11 illustrates the method and system for MSET- 
based surveillance. The MSET surveillance methods as 
described herein are embodied in one instance within the 
ANL software modules known as sys_mod.c and fault_ 
detect. c. Prior to performing surveillance for new operating 
data observations, a MSET fault detector initialization pro- 
cedure 106 is performed. The MSET fault detector initial- 
ization procedure 106 takes the variance (V) vector 100 and 
several other constants as its arguments. The initialization 
procedure makes use of one of a plurality of fault detection 
methods embodied in one instance within the ANL fault_ 
detect. c software module, including but not limited to the 
SPRT method, and the BSP method. The MSET surveillance 
procedure then proceeds by sequentially acquiring and 
evaluating each new data observation until such time as 
surveillance is completed. Data observations are acquired 
using the observation acquisition procedure 62. For each 
new observation vector, X obs , a parameter estimate vector, 
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X est , is produced by the parameter estimation procedure 66 
using the unpartitioned MSET model 102 with the same 
pattern recognition operator that was used in the MSET 
training procedure. The residual error vector, R, is computed 
5 and is then normalized using a residual value normalization 
procedure 108 that includes subtracting the mean (M) vector 
100 from the value of the residual error. The normalized 
residual vector is then evaluated using the same fault detec- 
tion procedure 68 that was initialized at the start of the 
10 MSET surveillance procedure. If the fault detection proce- 
dure 68 results in a fault determination by the fault indica- 
tion decision procedure 70, the alarm or control action 
procedure 74 communicates the fault information by the 
15 conventional communications link 80 (not shown) to the 
operator console 82 (not shown) and/or automated process 
control system 84 (not shown) for corrective action. In the 
fault indication decision procedure 70, a Bayesian condi- 
tional probability test is in general used to reach a fault 
20 decision based on a series of fault detection results from the 
fault detection procedure 68. The surveillance procedure 
then repeats for as long as the more data decision procedure 
72 determines that additional surveillance data is available. 

Limitations of the MSET Training and Surveillance 
25 Method and System: 

In the method and system described above, MSET is 
trained by the construction of a process memory matrix, D, 
based on historical operating data containing a collection of 
normal operating points of the process. MSET creates the 
30 process memory matrix by selecting representative process 
data observations (herein termed observation vectors) that 
characterize the dynamic patterns inherent across all oper- 
ating points of the process. However, if the process can 
operate in two or more distinct modes of operation, then the 
35 totality of operating points for all possible operating modes 
must be represented in the process memory matrix to 
produce an effective MSET model. As the number of distinct 
operating modes of process operation represented in the 
training data increases, one of two limitations occur: 

Limitation 1. If the total number of observation vectors in 
the process memory matrix is fixed, then the number of data 
patterns used to represent any single operating mode of a 
process decreases. This directly reduces the accuracy of 
45 MSET’s parameter estimates, which may result in false 
alarms or reduce the ability of the fault detection procedure 
to reliably detect subtle sensor failures or other process 
anomalies. 

The parameter estimation accuracy of the MSET algo- 
50 rithm is in general an inverse power law function of the 
number of vectors in the process memory matrix. Limitation 
1 is evident in the example of FIG. 12 that illustrates the 
overall parameter estimation error versus the number of 
vectors in the process memory matrix for an unpartitioned 
55 MSET model of six Space Shuttle Main Engine sensors. 

Limitation 2. Allowing the number of observation vectors 
in the process memory matrix to increase ameliorates Limi- 
tation 1 above, but incurs a computational performance cost. 
The number of computer operations required for MSET to 
60 produce a parameter estimate scales with the square of the 
number of observation vectors stored in the process memory 
matrix. This is because the MSET parameter estimation 
algorithm must perform pattern matching between the cur- 
rent operating data vector and each element of the process 
65 memory matrix. Pattern matching uses the Ginv matrix, the 
size of which increases as the square of the number of 
observation vectors. Processing time for MSET parameter 
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estimation has been empirically shown to follow a square 
law equation of the form: 

Observation processing time (msec)=A+Z?*[Number of observation 
vectors in D] 2 (7) 

Limitation 2 is evident in the example of FIG. 13 that 
illustrates the overall MSET parameter estimation process- 
ing time on a 300-MHz Pentium II desktop computer versus 
the number of vectors in the D matrix for an unpartitioned 
MSET model of six Space Shuttle Main Engine sensors. 

Novel Improvements to the MSET Training and Surveil- 
lance Procedures: 

Having described the MSET training and surveillance 
methods herein above, this section describes the novel 
improvements made by the instant invention when used for 
MSET training and surveillance, the improvements being 
applicable to any asset preferably having at least two distinct 
modes of operation. It is explained herein above that it is 
beneficial to minimize the number of vectors in the process 
memory matrix in order to optimize the processing speed of 
the MSET algorithm. It is further explained herein above 
that the MSET methods require a trade-off to be made 
between processing time and parameter estimation accuracy. 
In the worst case, this trade-off results in unacceptable 
performance for a process surveillance application. The 
novel solution to this problem made by the instant invention 
is to use multiple coordinated MSET parameter estimation 
submodels, with each submodel trained over a limited 
operating mode state space. With the instant invention, each 
submodel may be defined to contain only the minimum 
number of observation vectors required to adequately char- 
acterize a single specific operating mode or related subset of 
modes. Since only one submodel must be evaluated for each 
data observation presented to MSET during the surveillance 
procedure, both parameter estimation accuracy and process- 
ing speed are greatly improved. 

The following example illustrates an unobvious benefit of 
the instant invention. Consider a process that requires 
on-line surveillance across multiple modes of operation. 
Further consider that the safety or other critical nature of the 
surveillance requires fault decision performance within a 
time interval that allows for on-line MSET processing with 
a process memory matrix containing at most 100 vectors. 
However, further suppose that the desired fault detection 
accuracy requires on-line MSET parameter estimation with 
a process memory matrix containing 300 vectors to 
adequately characterize the operating mode state space. In 
the prior art, both criteria could not be simultaneously 
satisfied. The instant invention solves this problem for many 
types of processes and apparatuses by enabling the decision 
model designer to partition the operating mode state space 
and thus produce three 100 vector submodels providing the 
desired level of fault detection sensitivity (300 vectors) 
while having a processing speed comparable to the 100 
vector model. This implementation requires only the addi- 
tion of an operating mode determination procedure that 
selects the appropriate submodel for each new observation 
presented to the system and does not require a change to the 
MSET method itself. 

Improved Training Procedure: 

The combination of an MSET parameter estimation 
model 52 with an associated SPRT fault detection model 54 
is herein termed an MSET decision model 50. An MSET 
decision model is one of a plurality of possible implemen- 
tation specific instances of the decision model 50. 

FIG. 14 illustrates the training procedure 20 useful for 
producing one form of decision model 50 or specifically a 
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MSET decision model 50 in accordance with the instant 
invention. The training procedure includes and modifies the 
MSET training methods described in FIG. 10 and illustrated 
in FIG. 14 as MSET training procedure 118. With the instant 
5 invention, the model designer may now individually specify 
those operating modes for which MSET and SPRT training 
and surveillance is enabled. The training procedure loops 
through each defined operating mode with the loop con- 
trolled by the mode enabled decision procedure 34 and the 
10 more modes decision procedure 36. If the operating mode is 
enabled, a MSET decision submodel 114 is created (this is 
a specific example of parameter estimation submodel cre- 
ation procedure 29 and fault detection submodel creation 
procedure 30 when employing MSET and SPRT) for the 
15 operating mode. In order to create the MSET decision 
submodel 114, the operating mode specific training data 
subset 28 is first extracted from the training data set 24 using 
the operating mode determination procedure 26. This oper- 
ating mode specific training data subset 28 is then used to 
20 create the MSET decision submodel 114 using the same 
procedures used in the MSET training procedure 118 to 
create an unpartitioned MSET model. Specifically, the 
MSET procedures used in sequence are the MSET model 
extraction procedure 90 to produce the process memory 
25 matrix 92, the MSET model initialization procedure 94 to 
produce the inverse similarity matrix 96, and the MSET 
training data analysis procedure 98 to produce the residual 
mean and variance vectors 100 used by the SPRT fault 
detection procedure. Note that this series of procedures is 
30 grouped in the general case as the parameter estimation 
submodel creation procedure 29 and fault detection sub- 
model creation procedure 30. The process is repeated with 
each loop including a MSET decision submodel storage 
procedure 116 to add the MSET decision submodel 114 to 
35 the MSET decision model 50 for each enabled operating 
mode. At the conclusion of the training procedure 20, the 
operating mode partitioned MSET decision model 50 
includes an array of individual MSET decision submodels 
114, one for each enabled operating mode. 

40 The MSET decision model 50 is a collection of the 
operating mode specific MSET decision submodels. The 
MSET decision model 50 includes the following at a mini- 
mum: 

A set of process memory matrices 92 (D), one for each 
45 enabled operating mode; 

A set of inverse similarity matrices 96 (Ginv), one for 
each enabled operating mode; and 

A set of residual mean and variance vectors 100 (M and 
V), one for each enabled operating mode. 

50 Working together these decision submodels provide 
parameter estimation and fault detection over the entire 
operating mode state space that the designer has selected for 
surveillance. An additional novel feature of the instant 
invention is that each of the decision submodels in the 
55 MSET decision model 50 may be of unique dimensions that 
is, each submodel may contain unique numbers of modeled 
signal parameters and process memory matrix vectors. A 
decision submodel’s dimensions may be different than the 
dimensions selected for any other operating mode thereby 
60 permitting the unobvious benefit of further optimizing the 
MSET method and system for the surveillance requirements 
of each individual operating mode of the asset. This is 
important because certain modes of process operation are 
often more performance or safety critical than others. 

65 An additional novel feature of the instant invention is that 
in general each of the submodels in the decision model 50 
may also be specified with unique parameter estimation and 
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fault detector settings for each operating mode. This pro- 
vides the unobvious benefit of optimizing surveillance sen- 
sitivity and performance by operating mode. Examples of 
optimization by operating mode include, but are not limited 
to, the following: 5 

Selection of the parameter estimation and training algo- 
rithm by operating mode; 

Selection of the fault detection algorithm by operating 
mode; 

Selection of the fault detection procedure false alarm 
probability, missed alarm probability, system disturbance 
magnitude values, or other threshold constants by operating 
mode; 

Selection of the fault classification procedure algorithm 
and associated thresholds and constants by operating mode. 

Novel Improvements to the Parameter Estimation Proce- 15 
dure: 

FIG. 15 illustrates a novel method and system for the 
surveillance procedure 60 using the MSET decision model 
50 as delineated hereinabove. The surveillance procedure 60 
includes and modifies the surveillance methods described in 20 
FIG. 11 and illustrated in FIG. 15 as MSET surveillance 
procedure 122, 126. With the instant invention, the model 
designer may now individually specify those operating 
modes for which MSET surveillance is enabled. 

At the beginning of the surveillance procedure, the fault 25 
detection procedures are initialized for each enabled deci- 
sion submodel. Initialization of each MSET decision sub- 
model 114 uses the same fault detector initialization proce- 
dure 106 used for initialization of the unpartitioned model 
102. The surveillance procedure thereafter includes an open- 30 
ended loop for data acquisition and surveillance processing 
that is terminated by the more data decision procedure 72. 

During surveillance, each new vector of observed signal 
data values, X obs7 is acquired using the data acquisition 
procedure 40 and the observation acquisition procedure 62. 35 
Next, the operating mode determination procedure 26 is 
used to determine the operating mode for each new data 
observation, X obs , acquired from the asset 12. If the new data 
observation is determined by the mode enabled decision 
procedure 34 to represent an operating mode that is not 40 
enabled for surveillance, no further processing is required 
until the next data observation is acquired from the asset 12. 
Conversely, if the new data observation is determined to 
represent an enabled operating mode, the correct MSET 
decision submodel 114 is selected from the MSET decision 45 
model 50 using the decision submodel selection procedure 
64 and all required decision submodel data is loaded into the 
computer memory. From this point, surveillance processing 
occurs using the same procedures previously described for 
the MSET method until a fault indication is encountered. 50 
The method of the instant invention uniquely includes the 
fault classification procedure 76 that was not previously 
used in conjunction with MSET. Once surveillance process- 
ing is completed, the procedure returns to acquire a new data 
observation from the asset 12. 55 

An unobvious benefit of only performing processing for 
selected operating modes is that the MSET decision model 
does not need to provide parameter estimation capabilities 
for those operating modes that do not require on-line sur- 
veillance. For example, it may be desirable to exclude 60 
certain modes of operation (or non-operation) from the 
MSET decision model even though such modes are included 
within the training data. The ability to explicitly exclude 
operating modes that do not require surveillance simplifies 
the training data acquisition procedures and minimizes the 65 
on-line processing time required for a parameter estimation 
based surveillance method. 
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While an MSET procedure is described herein above, any 
type of parameter estimation procedure can be used with the 
instant invention. The novelty described infra is not a 
modification or improvement to the MSET procedure, but is 
rather a new means of using any parameter estimation 
procedure so as to more effectively accomplish the asset 
surveillance objective. 

SPRT Fault Detection Procedure: 

Parameter estimation techniques, such as delineated 
hereinabove, can provide an extremely accurate estimate of 
asset signals. The difference between the current estimates 
and the current signal observations provides a set of residual 
values used as the indicator for sensor and equipment faults. 
Instead of using standard threshold limits to detect fault 
conditions (i.e., declaring a fault indication when a signal’s 
residual value exceeds a preset threshold), we have demon- 
strated excellent fault detection performance using a sequen- 
tial probability ratio test (SPRT) fault detection procedure 68 
in conjunction with the present invention. 

The SPRT algorithm is one instance of a family of 
likelihood ratio tests that improve the threshold detection 
process by providing more definitive information about 
validity using statistical hypothesis testing. The SPRT tech- 
nique is based on user-specified false alarm and missed 
alarm probabilities, allowing control over the likelihood of 
missed detection or false alarms. The SPRT technique 
provides a superior surveillance tool because it is sensitive 
not only to disturbances in the signal mean, but also to very 
subtle changes in the statistical quality (variance, skewness, 
bias) of the signals. For sudden, gross failures of a sensor or 
component, the SPRT will indicate the disturbance as fast as 
a conventional threshold limit check. However, for slow 
degradation, the SPRT can detect the incipience or onset of 
the disturbance long before it would be apparent with 
conventional threshold limit checks. 

The SPRT fault detection procedure monitors successive 
observations of a process by analyzing the stochastic com- 
ponents of the signal’s residuals given by equation (4), 
above. Let R„ represent the residual signal at a given 
moment t n in time. Then the sequence of values {R n }={R 0 , 
R-l, . . . R„} should be normally distributed with mean 0, 
when the asset is operating normally. 

The first test utilized by the SPRT is a test for a change in 
the sequence mean, which declares that the signal is 
degraded if the sequence {R„} exhibits a non-zero mean, 
e.g., a mean of either ±M where M is the user-assigned 
system disturbance magnitude for the mean test. The SPRT 
will decide between two hypotheses: H 1? where the residual 
signal forms a Gaussian probability density function with 
mean M and variance a 2 ; or H 0 , where the residual signal 
forms a Gaussian probability density function with mean 0 
and variance o 2 . If the SPRT accepts H 1? we declare that the 
residual signal is degraded (a fault is detected). 

The SPRT fault detection procedure operates as follows. 
At each time step, a test index is calculated and compared to 
two threshold limits A and B. The test index is related to the 
likelihood ratio (L„), which is the ratio of the probability that 
hypothesis H a is true to the probability that hypothesis H 0 is 
true: 

Probability of sequence! Rn }gi ven Hi true (8) 

^ Probability of sequence! /?„}given Ho true 

If the likelihood ratio is greater than or equal to the upper 
threshold limit (i.e., L n =^B), then it can be concluded that 
hypothesis H 1 is true. If the likelihood ratio is less than or 
equal to the lower threshold limit (i.e., L„= A), then it can be 
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concluded that hypothesis H 0 is true. If the likelihood ratio 
falls between the two limits (i.e., A<L„<B), then neither 
hypothesis can be concluded to be true and sampling con- 
tinues. 

The SPRT technique allows the user to specify the tar- 5 
geted likelihood of missed detection or false alarm. The 
following expressions relate the threshold limits to the 
misidentification probabilities: 


1 - a a 


where a is the probability of accepting F^ when H 0 is true 
(i.e., the false alarm probability) and p is the probability of 
accepting H 0 when F^ is true (i.e., the missed detection 15 
probability). 

Assuming that the residual signal {R n } is normally 
distributed, the likelihood ratio L n is given by: 


Ln = exp 


2 


( 10 ) 20 


Defining the SPRT index for the mean test to be the 
exponent in equation (10), 25 

- 1 M (11) 

SPRT mem = M(M - 2 r k ) = - j) 

k=l £=1 

30 

Then by taking the natural logarithms of equations (9) and 
(10) the sequential sampling and decision strategy can be 
concisely represented as: 

If SPRT meflM =ln(p/(l-a)), then accept hypothesis H 0 as 
true, 35 

If ln(p/(l-a))^SPRT meflM ^ln((l-P)/a), then continue 
sampling, and 

If SPRT mea „iTn((l-p)/cx), then accept hypothesis Hj as 
true. 

Two indices are calculated for each of the sequential 40 
hypothesis tests. For the mean test, an index is calculated for 
both positive and negative values of the system disturbance 
magnitude, M. The system disturbance magnitude for the 
mean test specifies the number of standard deviations the 
residual signal must shift in the positive or negative direc- 45 
tion to trigger an alarm. 

The implementation of SPRT is originally described in 
Wald and Wolfowitz, “Optimum Character of the Sequential 
Probability Ratio Test,” Ann. Math. Stat., 19, 326 (1948), 
disclosure of which is incorporated herein by reference. 50 

Limitations of Prior Fault Detection Procedures 

Threshold limit tests, SPRTs and related techniques for 
fault detection are of great practical use in the field of asset 
surveillance and equipment condition monitoring. Many 
practitioners report their applications and value in a wide 55 
range of deployments. The majority of such deployments 
use threshold limit tests; however, more recently SPRTs and 
related techniques have come into significant use. 

A limitation of all prior fault detection techniques has 
been the inevitable trade off between false alarm rate and 60 
missed detection rate. When set too sensitively, the fault 
detector alarms as desired when actual fault conditions occur 
but might also alarm when normal process variations exceed 
the detector’s alarm limits (false alarms). When set too 
conservatively, the fault detector does not false alarm as 65 
readily but the detection of an actual fault condition is most 
often delayed or missed. 


The trade off between false alarm rate and missed detec- 
tion rate is aggravated when the asset is monitored over a 
range of operating modes. In practice, transient modes and 
certain operating modes will commonly imbue greater levels 
of uncertainty in the observed asset signals due to noise, 
bias, lead/lag, and other effects than will other operating 
modes. The practical result is increased uncertainty in both 
the current observed signal and in the parameter estimation 
model developed from training data signals. This leads to 
larger residuals in some asset operating modes than in 
others. Using a single set of alarm limits for fault detection 
results in higher false alarm rates for noisy operating modes 
and higher missed alarm rates for less noisy operating 
modes. 

Novel Improvements to the Fault Detection Procedure 

Since fault detection tests commonly operate on a math- 
ematical function of the residuals or other evaluation of the 
observed signal, the alarm limits used for the fault detection 
tests are more accurate and consistent when a separate set of 
alarm limits are provided for each operating mode of the 
asset. The method and system of the instant invention 
uniquely provides this capability. 

While a SPRT procedure is described herein above, any 
type of fault detection procedure can be used with the instant 
invention. The novelty described infra is not a modification 
or improvement to the SPRT procedure, but is rather a new 
means of using any existing fault detection procedure so as 
to more effectively accomplish the asset surveillance objec- 
tive. 

BBN Fault Classification Procedure 

Bayesian belief networks (BBNs) are applied in Decision 
Theory under various names, including Causal Probabilistic 
Networks, Causal Nets, and Probabilistic Influence Dia- 
grams. BBN methods provide accurate calculations of 
expected false alarm and missed alarm rates, are resilient to 
missing information (e.g., instrumentation failures), and 
gracefully handle multiple failures. BBNs are based on the 
calculus of probabilities and provide the mathematical for- 
malism whereby engineering judgment may be expressed as 
the degree of belief in an outcome given a set of observa- 
tions. They have been used in a wide variety of applications 
to represent probabilistic knowledge for automated reason- 
ing. Using Bayesian probability theory, it is possible to 
capture engineering knowledge about the dependencies 
between variables and to propagate consistently and quan- 
titatively the impact of evidence for diagnosing the current 
condition of an asset. Specifically, belief networks are 
graphical representations of a joint probability distribution 
in which the graph nodes represent discrete-valued random 
variables and the arcs between nodes represent influences 
between variables. 

The BBN is one example of a fault classification proce- 
dure 76 that might be used in conjunction with the instant 
invention. Other examples include, but are not limited to, 
procedural logic, rule -based expert systems, blackboard 
expert systems, decision trees, determined similarity 
methods, various forms of probabilistic networks and neural 
networks. We use a belief network to capture our knowledge 
about an asset and its operation. A key characteristic of a 
BBN is its ability to deal with the inherent uncertainty in our 
knowledge about a monitored asset. The process of building 
an application-specific belief network is as follows: 

Define the nodes — define the model variables to be 
observed or predicted; 

Define the network topology — define the user’s knowl- 
edge of the conditional dependence and independence 
between nodes; 
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Define the network’s probability tables — quantify statis- 
tical knowledge, uncertainty, and the user’s engineering 
judgments. 

Once this knowledge is expressed, the belief network can 
perform fault classification by calculating the probability 5 
states of the predicted nodes from the tables of conditional 
and marginal probabilities and the states of the observed 
nodes. 

Bayesian belief networks allow us to express our beliefs 10 
with a degree of uncertainty. In most cases, cause and effect 
relationships are not absolute, instead we hold a certain 
degree of belief in these relationships. For example, if we 
say “Birds fly.” we accept that for the most part birds do, in 
fact, fly. However, ostriches don’t fly, neither do penguins, 15 
birds with broken wings, or dead or injured birds. These 
exceptions prevent us from saying with 100% certainty that 
“Birds fly.” It is much more accurate and more natural to say 
that we believe that “Birds fly most of the time.” It is this 
ability to quantify the degree of belief in a proposition, 20 
which makes the BBN useful for a fault classification 
procedure. 

Belief Networks Have Four Necessary Characteristics: 

1. A belief network is built from nodes that represent 25 
variables that can take on multiple values or states. These 
nodes may represent the state of the sensors or fault 
detectors in the model, the cause of any fault, or a decision 

to be made based on the evidence for a fault. 

2. Each of these nodes is connected to at least one other node 30 
by a directed link. The intuitive meaning of these links is 
that the parent has a direct influence on the child. For 
example, there is a greater probability that a bird will fly 

if it is not injured. These relationships are probabilistic as 
opposed to being deterministic and are the reason for a 35 
link between the “Birds fly” node and the “Injured” node. 

A lack of arcs indicates that the two variables are proba- 
bilistically independent. That is, one has no effect on the 
other. If birds fly regardless of the state of the stock 
market, there will be no link between “Birds Fly” and 40 
“Stock Market”. We can also say that two nodes, which 
have one or more common parents but no link between 
them, are conditionally independent. Finally, a node is 
conditionally independent of its indirect predecessors 
given its immediate predecessors. 45 

3. Each node has a conditional probability table that quan- 

tifies the effects of the parents on the children. For 
example, given that a bird is flying, what is the probability 
that it is injured? Nodes with no parents have a prior 
probability table. 50 

4. There may be no directed cycles in the belief network. 

A BBN model enables the probabilistic classification of 

asset fault events on the basis of the probabilities of other 
observable events. The belief network specifically enables 
expert judgment to be formalized so that one can specify a 55 
series of links of the form the Truth of statement A supports 
my belief in statement B’, and can also specify how much 
the truth of A strengthens this belief in B compared, for 
example, to how much some other truth C would weaken 
this belief in B. This information is described mathemati- 60 
cally by giving the probabilities of the states of A conditional 
on those of B, or vice-versa, or via correlation coefficients 
between the states of A and B. 

Bayesian belief networks are mathematically based on the 
science of probability. Bayes’ law is based on the product 65 
rule which states that the probability of A and B is equal to 
the probability of A given B times the probability of B: 


P{A*B)=P{A\B)P{B) (12) 

similarly, 

P(A*B)=P(B\A)P(A) (13) 

Combining these two equations yields Bayes’ law: 


P(A\B) = 


P(B | A)P{A) 

W) 


( 14 ) 


In the equation above, A denotes a particular fault and B 
is a fault indication event or alarm that has occurred. P(B) 
is the probability of event B occurring, and P(B|A) is the 
probability of B occurring given that A has already occurred. 
For example, given that a sensor has failed (event A), P(B|A) 
is the probability that the signal’s fault detector will alarm 
(event B). P(B) is referred to as an a priori probability and 
P(B|A) is referred to as a conditional probability. If these 
probabilities are known, then we can calculate the probabil- 
ity of A occurring given that B has occurred, P(A|B). 

In general, the indication that an event has occurred has 
multiple pieces of evidence. In that case, it can also be 
shown that if B is a set of events, B, then 


where a is the normalizing constant. 

When there are multiple levels to a belief net, we apply 
this same formula recursively to the parents, as: 


P(Xi,X 2 , ... ,*„) = n p(xi i n*,) 


( 16 ) 


where nx, is the parent set of x f . 

The theory underlying the formulation and use of belief 
networks and related techniques for probabilistic reasoning 
is detailed in Judea Pearl, “Probabilistic Reasoning in Intel- 
ligent Systems: Networks of Plausible Inference,” Morgan 
Kaufmann, 1988, the disclosure of which is incorporated 
herein by reference. 

Limitations of Prior Fault Classification Procedures 

Expert systems, belief networks and related techniques 
for probabilistic reasoning have been of limited practical use 
in the field of asset surveillance and equipment condition 
monitoring. While many authors and practitioners suggest 
their value, deployments are limited to cases where the asset 
behavior is steady and predictable. The majority of such 
deployments use rule-based expert systems having simple 
backward chaining schemes to classify faults on the basis of 
a limited number of observable data features. 

Gross suggests the value of an AI (artificial intelligence) 
diagnostic/prognostic system in U.S. Pat. No. 5,764,509 and 
again in U.S. Pat. No. 6,181,975. However, Gross does not 
report a reduction to practice or state claims for such an AI 
diagnostic/prognostic technique despite the apparent desir- 
ability of doing so. Moreover, Gross does not conceive of 
the novelty described infra for operating mode partitioning 
of an AI diagnostic/prognostic system. 

Takeuchi describes a rule -based diagnostic/prognostic 
system in U.S. Pat. No. 5,009,833. While Takeuchi claims 
operation of the rule-base on either observed or simulated 
data, he does not conceive of the novelty of comparing the 
observed signals to estimated signals for the purpose of fault 
detection nor does he conceive of the novelty of using the 



US 6,917,839 B2 


27 

fault detection results as the observable inputs to his expert 
system. Moreover, Takeuchi does not conceive of the nov- 
elty described infra for operating mode partitioning of the 
rule-based diagnostic/prognostic system. 

Gross describes a rule-based diagnostic system in U.S. 5 
Pat. Nos. 5,223,207, 5,459,675 and 5,761,090. Gross 
describes a SPRT fault detection means that operates on the 
residuals formed from redundant physical sensor signals. 
The SPRT fault detection means provides the observable 
inputs to his expert system. However, Gross does not 10 
conceive of the novelty of comparing the observed signals to 
estimated signals for the purpose of fault detection. 
Moreover, Gross does not conceive of the novelty described 
infra for operating mode partitioning of the fault detection 
means or of the rule-based diagnostic system 15 

In U.S. Pat. No. 5,274,572, O’Neill describes a black- 
board expert system for asset surveillance but does not 
conceive of the novelty described infra for operating mode 
partitioning of the blackboard expert system. In U.S. Pat. 

No. 5,392,320, Chao describes a rule-based expert system 20 
for asset surveillance but does not conceive of the novelty 
described infra for operating mode partitioning of the rule- 
based expert system. In U.S. Pat. No. 5,402,521, Niida 
describes a neural network system for asset surveillance but 
does not conceive of the novelty described infra for oper- 25 
ating mode partitioning of the neural network system. In 
U.S. Pat. No. 5,465,321, Smyth describes a hidden Markov 
model system for asset surveillance but does not conceive of 
the novelty described infra for operating mode partitioning 
of the hidden Markov model. In U.S. Pat. No. 5,680,409, 30 
Qin describes a principal component analysis system for 
asset surveillance but does not conceive of the novelty 
described infra for operating mode partitioning of the prin- 
cipal component analysis model. 

Heger describes a Bayesian belief network (BBN) system 35 
for asset surveillance in U.S. Pat. No. 6,415,276 but does not 
conceive of the novelty described infra for operating mode 
partitioning. U.S. Pat. No. 6,415,276 provides a discussion 
of the use of belief networks for fault classification and is 
incorporated in its entirety herein by reference. 40 

Novel Improvements to the BBN Fault Classification 
Procedure 

Each of the fault classification procedures described in the 
section above might be improved by the novelty described 
infra for operating mode partitioning. Fault classification is 45 
improved by operating mode partitioning because operating 
mode partitioning allows a unique and possibly different set 
of faults to be classified within each operating mode and 
further allows the signature and probability of each fault 
type to be set differently within each operating mode. For 50 
example, faults having low probability of occurrence in one 
operating mode and a high probability of occurrence in 
another operating mode are more effectively classified sepa- 
rately. Further, the fault indication patterns that are indica- 
tive of an unacceptable asset condition might be set differ- 55 
ently in one operating mode than in another (see for 
example, FIG. 7). 

It is noteworthy that none of the practitioners enumerated 
above conceive of a method or system consisting of a unique 
combination of the following three elements: 60 

1. A parameter estimation method and system for producing 
a current estimate of signal data values correlative to an 
observation of signal data values acquired from an asset; 

2. A fault detection method and system for comparing one or 
more of the current estimates of signal data values to the 65 
corresponding observed signal data values acquired from 

an asset to produce fault indications; and 
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3. A fault classification method and system for determining 
the presence, cause and/or explanation of an unacceptable 
asset status or condition based on the fault indication 
results. 

Moreover, it is noteworthy that none of these practitioners 
conceives of any method or system that further includes the 
fourth element as follows: 

4. An operating mode partitioning method and system for 
providing more accurate and efficient performance of any 
of the three elements enumerated above by partitioning 
the characteristics of at least one of the three elements 
according to the expected operating modes of the asset. 
Neural Network Method and System for Determining 

Operating Mode of Asset: 

A method to determine the operating mode of the asset 12 
is required for both the training procedure 20 and the 
surveillance procedure 60 using an operating mode parti- 
tioned decision model 50. For each new data observation, 
the operating mode determination procedure 26 must clas- 
sify the observation as belonging to exactly one of a plurality 
of defined operating modes thereby allowing the required 
decision submodel to be selected for training or surveillance. 
The operating mode determination procedure 26 may use 
any form of algorithm that can determine the current oper- 
ating mode of the asset 12 based on one or more data 
observations from the asset. The specific implementation or 
type of the operating mode determination procedure 26 does 
not affect or modify the operation of the instant invention. 

In an embodiment of the invention, a Learning Vector 
Quantization (LVQ) neural network is used for the operating 
mode determination procedure 26. The LVQ neural network 
procedure is generally applicable to a wide range of assets. 
An LVQ neural network model is created for a specific asset 
using conventional neural network training algorithms to 
learn the inherent operating mode relationships within a set 
of historical process operating data. The trained LVQ model 
is then used to perform the operating mode determination 
procedure when presented with each new data observation. 
Because the LVQ neural network is trained by pattern 
matching a vector of observations from historical data, this 
type of neural network will always determine the most 
similar operating mode when presented with a new data 
observation. 

An LVQ neural network is a two -layer, pattern classifi- 
cation neural network in which each output node represents 
a particular class or category. 

FIG. 16 illustrates the architecture of an LVQ neural 
network. An LVQ network is one of a group of related 
pattern classification neural network models that can be used 
to cluster a set of s-element input vectors {X}={x 1 , . . . , 
X;, . . . , x s } into t clusters. The input nodes of the neural 
network draw data either directly from sensor signals or 
from the output of a mathematical function applied to one or 
more sensor signals. An input vector is defined as the set of 
data values, one value for each input node that is derived 
from the sensor signals at a given moment in time. The 
output nodes of the network correspond to one of the classes 
(herein, the operating modes) recognized by the neural 
network. During operation of the neural network, an input 
vector is presented to the network, passes through the 
network, and activates one of the t output nodes (y 1? . . . , 
Yj , . . . , y r ). Each of the output nodes corresponds to one of 
the classes recognized by the neural network. The LVQ 
neural network returns the class corresponding to the acti- 
vated output node, thereby determining the current operating 
mode of the asset. 

The input nodes are connected to the output nodes by a set 
of connection weights. The subset of connection weights 
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that connect all of the input nodes to one of the output nodes 
is called a weight vector. For example, output node y ■ is 
connected to the input nodes by weight vector 
{W / }={w l j -, . . . , w £ ., . . . , w ■}. An LVQ neural network 
that contains s input nodes and t output nodes would contain 5 
a total of t weight vectors, with each weight vector contain- 
ing s connection weights. 

An LVQ neural network is designed to recognize a 
predefined set of classes. Each one of the classes corre- 
sponds to a distinct operating mode of the asset under 
surveillance. During training of an LVQ neural network, the 
designer decides how many output nodes will be used to 
model each of the operating modes classified by the net- 
work. More than one output node can be used to represent 
a class (operating mode) recognized by the neural network. 

By using more than one node to represent a class, the 15 
number of neural network connection weights dedicated to 
that class is increased. This improves the ability of the neural 
network to recognize an operating mode of the asset. For 
each of the r classes, the designer specifies the number of 
output nodes that will model that class. 20 

A supervised training scheme is used for training an LVQ 
neural network. In this scheme, training is accomplished by 
presenting a sequence of matched pairs of input vectors and 
target vectors to the neural network, causing some of the 
network’s connection weights to be adjusted with each 2 s 
presentation of a training pair. The target vector 
{T}={t 1 , . . . , t £ , . . . , t f } is a set of binary values, one value 
for each output node in the network. An element of a target 
vector has a value of one if the corresponding output node 
represents the correct class for the input vector. Conversely, 
an element of a target vector has a value of zero if the 
corresponding output node represents an incorrect class for 
the input vector. 

For each training pair presented to the LVQ network, the 
Euclidean distance between the input vector and each of the ^ 
weight vectors is calculated. The Euclidean distances are 
then ordered, from smallest to largest. Only the weight 
vectors that produce the smallest two distances in the 
ordered sequence are allowed to learn. This form of learning 
is called competition, because only those weight vectors that 
produce the best scores (i.e., producing the minim um 
Euclidean distances) are modified during an iteration of the 
training algorithm. Three commonly used learning methods 
for training an LVQ neural network are herein designated 
LVQ1, LVQ2.1, and LVQ3. 45 

In the first learning method (LVQ1), only the weight 
vector that is closest to the current input vector (i.e., the 
weight vector that produces the minimum Euclidean 
distance) is allowed to learn. For each matched pair of input 
and training vectors presented to an LVQ network during 
training, the Euclidean distance between the input vector 
and each of the weight vectors is calculated and the output 
node connected to the weight vector that produces the 
minimum Euclidean distance is identified. If the output node 
that produces the minimum Euclidean distance corresponds 
to the correct operating mode, the connection weights for the 
output vector are positively reinforced as follows. Let the 
subscript j represent the output node whose weight vector 
produces the minimum Euclidean distance. If the target 
value for that output node is 1 (i.e., t y =l), then the weight 
vector for the output node (Wy) is updated by 

W r % + k(X-Wj) (17) 

where X is the current input vector and X is a scalar 
parameter called the learning rate that varies from 0 to 1. If 65 
the output node whose weight vector produces the minimum 
Euclidean distance corresponds to the incorrect operating 


mode (i.e., ty=0), the connection weights for the output 
vector are negatively reinforced by 

w y =w r x(x-w y ) ( 18 ) 

In the second (LVQ2.1) and third (LVQ3) learning 
methods, the two weight vectors that are closest to the 
current input vector are identified. These two weight vectors 
may be positively or negatively reinforced depending upon 
a number of conditions. The most important of these con- 
ditions is that the two weight vectors are modified only if 
they are roughly equidistant from the input vector. A user- 
defined control parameter (e), called the window size, is 
used to determine whether or not the two weight vectors are 
of comparable distances from the input vector. The window 
condition test that must be satisfied by the two closest weight 
vectors is that the ratio of distance between the closest 
weight vector and the input vector (d cl ) to the distance 
between the second closest weight vector and the input 
vector (d c2 ) must fall within the window. Namely, 


d c 1 d c 7 

— — > 1 - e and — < 1 + s 
d c2 d c j 


(19) 


The window size is a small user-defined constant with 
typical values in the range 0.1<e<0.5. 

In the LVQ2.1 algorithm, a second condition that must be 
met is that one of the two closest weight vectors connects to 
an output node of the same class as the input vector. While 
at the same time, the other weight vector must connect to an 
output node of a class that differs from the class of the input 
vector. If both the window and class conditions are met, then 
the weight vector whose output node belongs to the same 
class as the input vector is positively reinforced according to 
equation (17). Also, the weight vector whose output node 
belongs to a class that differs from that of the input vector 
is negatively reinforced according to equation (18). 

In the LVQ3 algorithm, the two weight vectors that are 
closest to the input vector are allowed to learn as long as the 
same window and class conditions as in the LVQ2.1 algo- 
rithm are met. The LVQ3 algorithm contains an additional 
learning mode. If the two weight vectors that are closest to 
the input vector meet the window condition (i.e., the con- 
ditions in equation (19) are met), and if both weight vectors 
connect to output nodes that are of the same class as the 
input vector, then both weight vectors are positively rein- 
forced. Both weight vectors are updated by 

W=W+SA,(X-W) (20) 

where 8 is a user-defined parameter, called the LVQ3 
multiplier, that reduces the learning rate. The LVQ3 multi- 
plier is a small constant with typical values in the range 
0.1<8<0.5. 

The concept behind the LVQ2.1 and LVQ3 learning 
methods is that as the input vectors used for training are 
presented to the neural network, learning occurs only when 
an input vector is close to two of the weight vectors. In this 
case, the input vector is near the boundary between two 
weight vectors. Learning occurs in the LVQ2. 1 algorithm 
only if one of the weight vectors belongs to the same output 
class as the input vector and the other weight vector belongs 
to a different class. The weight vector belonging to the 
correct class is positively reinforced and the other vector is 
negatively reinforced. The LVQ3 algorithm contains the 
same conditions as the LVQ2.1 algorithm. But an additional 
condition in the LVQ3 algorithm allows the network to 
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learn, at a slower rate, if both weight vectors belong to the 
same class as the input vector. Over the course of the 
iterative training procedure, this technique works to sharply 
define the boundaries between the vector spaces recognized 
by each weight vector. 

Aset of input vectors and corresponding target vectors are 
used to train the LVQ neural network. The set of input and 
target vectors is presented to the network and the connection 
weights are adjusted depending upon the learning algorithm 
selected. Then, the learning rate parameter (X) is decreased 
by a small amount and the set of input and target vectors is 
passed through the network again. The cycle is repeated 
until the learning rate decreases to zero or until the error rate 
for the neural network converges. Each training cycle of data 
presentation and learning rate reduction is called an epoch. 
The maximum number of epochs (n ) to be performed by 
the training algorithm is a user-defined control parameter. 
The learning rate decreases linearly with epoch number, 
with the learning rate decreasing to zero when the maximum 
number of epochs is reached. The initial value of the 
learning rate (X 0 ) is a user-defined control parameter that, 
along with the maximum the number of epochs, determines 
the rate at which the learning rate is decreased. Specifically, 
the learning rate is decreased by a factor of n^/X 0 at the end 
of each epoch. 

During each training epoch, the error rate for the neural 
network is calculated. The error rate is defined to be the 
fraction of input vectors that are incorrectly classified by the 
neural network. An input vector is correctly classified if the 
weight vector that is closest to it connects to an output node 
of the same class as the input vector. As each input vector in 
the training set is passed through the LVQ neural network 
during a training epoch, the program notes if the input vector 
was correctly or incorrectly classified. The error rate is then 
given by the ratio of the number of incorrectly classified 
input vectors to the total number of input vectors in the 
training set. By keeping track of the error rate, the training 
algorithm can be halted as soon as the neural network stops 
learning. 

The learning methods devised for the LVQ neural network 
are fine-tuning procedures. Only slight modifications are 
made to the network weight vectors during any training 
epoch. Therefore to minimize the number of epochs needed 
to train the neural network, the initial values of the weight 
vectors must be chosen wisely. The simplest method of 
initializing the weight vectors is to randomly select t vectors 
from the set of input vectors used to train the neural network 
and use them as initial values for the weight vectors, where 
t is the number of output nodes in the network. Although this 
initialization method works, a better method, which in 
general reduces the number of epochs needed to adequately 
train the network is to use the K-means clustering algorithm 
to set the initial values of the weight vectors. The K-means 
clustering algorithm is a method that will divide a vector 
space into K clusters and identify the centers of each cluster. 
The K-means clustering algorithm can be used to divide the 
input vectors used to train the LVQ network into t clusters 
and use the centers of the clusters as the initial values for the 
weight vectors. 

The K-mean clustering algorithm is used to initialize the 
weight vectors as follows. For each of the r classes recog- 
nized by the network, the input vectors that belong to each 
class are identified and collected into r arrays. Next the 
output nodes that belong to each class are identified. By 
definition, the number of output nodes that belong to each 
class is given by the nodes-per-class vector (N c/flSS ). Then for 
each class, the K-means clustering algorithm is used to 
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cluster the input vectors that belong to the class into a 
number of clusters that equals the number of output nodes 
that belong to the class. For instance for class j, the K-means 
clustering algorithm is used to divide the input vectors into 
5 nouty clusters and to evaluate the centers of the clusters. The 
cluster centers for class j are used to initialize the weight 
vectors whose output nodes belong to the class. The 
K-means clustering algorithm evaluates cluster centers for 
the class by minimizing the Euclidean distances between 
10 each of the input vectors in the class and the cluster center 
nearest to each. Thus, each cluster center is the mean value 
of the group of input vectors in a cluster domain. The 
K-means clustering algorithm was found to improve the 
recall capabilities of the neural network over the random 
15 initialization scheme, at a minimal increase in the compu- 
tational cost of the training calculations. 

A trained LVQ neural network operates as follows. At a 
point in time, a current data observation is acquired from the 
asset 12 and an input vector is constructed. The Euclidean 
20 distance between the input vector and each of the weight 
vectors is calculated. The weight vector producing the 
minimum Euclidean distance is found and its corresponding 
output node is activated. The neural network declares the 
operating mode corresponding to the activated output node 
25 to be the current operating mode of the asset 12 under 
surveillance. 

In Use and In Operation Using A MSET Parameter 
Estimation Model and A Neural Network for Determining 
the Operating Mode of the Asset 
30 Operating mode partitioned decision processing was first 
reduced to practice by applicant in the performance of 
NASA Contract NAS4-99012. Testing performed under this 
contract conclusively demonstrated the reduction to practice 
for and unobvious benefits of the instant invention. The 
35 contract final report and new technology disclosure docu- 
ments by applicant, delivered to the United States Govern- 
ment under this contract and listed herein below, further 
describe one embodiment and its reduction to practice, the 
disclosure of which is incorporated in its entirety herein by 
40 reference. 

NASA SBIR Phase I Final Report, “System State Deter- 
mination for Real-Time Sensor Validation,” NASA Contract 
NAS4-99012, 12 Jun. 1999. Publication or disclosure 
restricted to US Government personnel for four years pur- 
45 suant to Code of Federal Regulations 48 CFR 52.227-20. 

New Technology Report for NASA Contract NAS4- 
99012, “Phase Partitioning the Multivariate State Estimation 
Technique (MSET) Process for Improved Parameter Esti- 
mation Performance and Processing Speed,” Expert 
50 Microsystems, Inc. Document Control Number 2000-4446, 
24 Jan. 2000. Publication or disclosure restricted to US 
Government personnel for four years pursuant to Code of 
Federal Regulations 48 CFR 52.227-20. 

New Technology Report for NASA Contract NAS4- 
55 99012, “System State Classification Using A Learning Vec- 
tor Quantization (LVQ) Neural Network,” Expert 
Microsystems, Inc. Document Control Number 2000-4447, 
24 Jan. 2000. Publication or disclosure restricted to US 
Government personnel for four years pursuant to Code of 
60 Federal Regulations 48 CFR 52.227-20. 

In the performance of NASA Contract NAS4-99012, a 
sensor validation software module was designed to validate 
seventeen (17) mission critical telemetry signals for the 
Space Shuttle Main Engine (SSME), as listed in FIG. 18. 
65 These signals were selected based on their importance for 
real-time telemetry monitoring of the three Space Shuttle 
Main Engines during vehicle ascent to orbit. The names 
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listed in FIG. 18 use standard SSME nomenclature. Data 
from ten nominal Space Shuttle flights, with flights and 
engine positions as listed in FIG. 19, were selected as the 
training data for the MSET submodels and LVQ neural 
network used in the performance of this work. 

A series of parametric studies were performed to deter- 
mine the LVQ neural network configuration and training 
constants that provide the best performance for SSME 
operating mode determination. The neural network configu- 
ration and training constants selected for applicant’s reduc- 
tion to practice are defined in FIG. 17. Ten SSME flight data 
sets, defined in FIG. 19, were used to train the neural 
network. The operating mode determination capability of 
the LVQ neural network was shown to be excellent with 
operating mode classification error rates of less than 2% 
observed in testing with additional SSME flight data sets 
that were not used for training the neural network. 
Specifically, FIG. 20 illustrates three versions of the sensor 
validation software module. The first sensor validation soft- 
ware module, herein denoted the PD module, was created by 
the methods of the instant invention with a process memory 
matrix (D) size of 150 vectors for each operating mode 
partitioned MSET submodel in the MSET decision model. 
The PD module’s MSET decision submodels were created 
using an LVQ neural network for the operating mode 
determination procedure. The second sensor validation soft- 
ware module, herein denoted the A150 module, was created 
by the unpartitioned MSET model creation procedure with 
a process memory matrix (D) size of 150 vectors used in the 
unpartitioned MSET model. This enabled a direct compari- 
son of surveillance performance between the operating 
mode partitioned (instant invention) and unpartitioned mod- 
els given a constant processing time. The third sensor 
validation software module, herein denoted the A300 
module, was created by the unpartitioned MSET model 
creation procedure with a process memory matrix (D) size of 
300 vectors used in the unpartitioned MSET model. The 
A3 00 module enabled improved surveillance performance 
for the unpartitioned MSET model case, albeit at the cost of 
greater processing time. 

FIG. 20 further lists the parameter estimation model and 
fault detector configurations used for feasibility testing. 

The operating mode partitioned sensor validation module 
(denoted PD) incorporated an MSET decision model parti- 
tioned into seven (7) modes representative of the primary 
operating modes of the SSME. The rules used for partition- 
ing the training data for the SSME operating modes are 
provided in FIG. 21. The two unpartitioned sensor validation 
modules (denoted A150 and A300) were prepared using 
exactly the same training data without the benefit of oper- 
ating mode partitioning. 

Hie Argonne National Laboratory System State Analyzer 
(SSA) type pattern recognition operator was used in all of 
the MSET models. The fault detection models were all based 
on the SPRT mean positive and mean negative test methods. 
SPRT is a statistically derived test statistic with an explicit, 
non-zero false alarm probability. For this reason, SPRT fault 
detectors are generally used in combination with a multi- 
cycle fault decision algorithm to filter out the possible 
one-cycle SPRT alarms. The fault decision procedure was 
configured using a four (4) out of seven (7) multi-cycle 
decision algorithm. This fault decision procedure will 
declare a sensor failure whenever any 4 of the last 7 
observation cycles produce any type of one-cycle SPRT 
fault detection alarm. 

Performance testing clearly demonstrated the feasibility 
and benefits of using the operating mode partitioned MSET 
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decision model for real-time sensor signal validation. Met- 
rics used to evaluate the test results included the following: 

Total One Cycle Alarm Count — This is a measure of the 
total number of SPRT fault detector generated alarms for a 
5 single simulation run. For nominal cases, this is expected to 
be a near zero number. For failure simulation cases, the 
number will be non-zero. This metric provides a measure of 
the overall performance of the fault detection procedure. 

Average Parameter Estimation Error Percentage — This is 
10 a measure of the globally averaged parameter estimation 
error. The global averaged error is the sum of the single 
cycle error for all sensors and data observations divided by 
the total number of sensors and data observations. This 
metric provides a measure of the overall performance of the 
15 parameter estimation procedure. 

Average One Cycle Processing Time — This is a measure 
of the globally averaged single cycle validation processing 
time. The one cycle processing time is the sum of the 
processing time for all validated data observations divided 
20 by the total number of validated data observations. The 
processing time is calculated as the elapsed time between the 
time of the test driver’s call to the sensor validation mod- 
ule’s surveillance procedure and the time that the surveil- 
lance procedure returns its results to the test driver. 

Time to Failure Detection (Failure Simulations Only) — 
This is a measure of the elapsed time between the first 
observation containing sensor failure data and the observa- 
tion for which the sensor validation module declares the 
3Q sensor failed. Time to fault detection depends on the diag- 
nostic capability of the sensor validation module, the time of 
failure occurrence and the nature and magnitude of the 
sensor failure. The data herein report the elapsed mission 
time between the initiation of a slow drift in the signal and 
35 the time that the drift failure was detected. For consistency, 
all test cases herein used a drift magnitude of 0.2% of the 
nominal, full power level value of the sensor signal applied 
per second of engine operating time. 

Signal Error at Failure Detection (Failure Simulations 
40 Only) — This is a measure of the total accumulated drift error 
in a sensor signal at the time of failure detection. The data 
reported herein normalize the error at the time of detection 
in terms of a percentage of the nominal, full power level 
value of the sensor signal. 

45 The results tabulated in FIGS. 22 through 27 demonstrate 
the very significant improvement in sensor validation per- 
formance achieved using the operating mode partitioned 
MSET decision model in accordance with the instant inven- 
tion. The operating mode partitioned MSET decision model 
50 provided better fault detection sensitivity, lower parameter 
estimation error, and much faster processing time in com- 
parison to the unpartitioned MSET models. The operating 
mode partitioned MSET decision model exhibited zero (0) 
false alarms and zero (0) missed alarms during all testing 
55 performed. The results tabulated in FIGS. 22 and 23 were 
generated using an LVQ neural network for the operating 
mode determination procedure. 

Two test series were performed for comparison of the 
operating mode partitioned sensor validation module to the 
60 unpartitioned modules. In the first series, an unpartitioned 
model with a process memory matrix of 300 vectors was 
constructed (denoted A300). The operating mode partitioned 
model (denoted PD) used a process memory matrix of 150 
vectors for each individual operating mode. When compared 
65 to the 300 vector unpartitioned model, the operating mode 
partitioned MSET decision model in accordance with the 
instant invention demonstrated: 
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34% reduction in parameter estimation error; 

73% reduction in per cycle processing time; 

73% reduction in time to detect a sensor signal drift; 

73% reduction in the total signal error at drift failure 
detection. 5 

In addition, the 300 vector unpartitioned model missed 
two subtle noise failures that were properly detected by the 
operating mode partitioned decision model in accordance 
with the instant invention. 

In the second series, the operating mode partitioned 10 
decision model was compared to an unpartitioned model of 
equivalent run-time speed. To accomplish this, an unparti- 
tioned model with a process memory matrix of 150 vectors 
was constructed (denoted A150). When compared to the 150 
vector unpartitioned model, the operating mode partitioned 15 
decision model in accordance with the instant invention 
demonstrated: 

42% reduction in parameter estimation error; 

Equivalent per cycle processing time; 

77% reduction in time to detect a sensor signal drift; 20 

76% reduction in the total signal error at drift failure 
detection. 

In addition, the 150 vector unpartitioned model produced 
two sensor failure false alarms and missed one noise failure 
in cases that were properly detected by the operating mode 25 
partitioned decision model in accordance with the instant 
invention. 

The operating mode partitioned decision model provides 
better fault detection sensitivity because the operating mode 
specific MSET decision submodels are better able to esti- 30 
mate the current value of each observed parameter. This 
capability of the operating mode partitioned decision model 
is demonstrated by the reduction achieved in the parameter 
estimation error. Reduced parameter estimation error allows 
the SPRT thresholds for the fault detection model to be set 35 
to lower values thereby making the fault detection model 
more sensitive to the early indications of sensor failure 
(fewer missed alarms). This phenomenon proportionally 
reduces the time to drift failure detection as illustrated by 
comparison of the results reported in FIG. 23 to the results 40 
reported in FIG. 25 and FIG. 27. 

Parameter estimation error may be traded off against 
processing time by increasing the number of vectors in the 
process memory matrix. As is evident by comparison of 
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FIG. 24 and FIG. 26, doubling the process memory matrix 
size increased the single cycle processing time by a factor of 
four (2 2 ). Operating mode partitioning provides an effec- 
tively larger process memory matrix without the concomi- 
tant penalty in processing time. For example, the operating 
mode partitioned SSME sensor validation module (PD) 
includes seven active operating modes with process memory 
matrices sized at 150 vectors per mode. This provides an 
effective process memory matrix size of 1050 vectors with 
processing speed equivalent to a process memory matrix 
containing 150 vectors. A single unpartitioned model of 
equivalent accuracy would be 49 (7 2 ) times slower than the 
operating mode partitioned decision model. 

Processing speed results demonstrated the real-time 
monitoring capability of the operating mode partitioned 
decision model. Single observation processing times of 
5-msec (200 samples/second) were demonstrated with the 
seventeen (17) sensor SSME sensor validation module run- 
ning on a 300-MHz Pentium II processor. It is reasonable to 
allocate between 2 and 50-msec per data cycle for sensor 
validation processing in SSME real-time control applica- 
tions. The results of this testing show these goals are only 
attainable with operating mode partitioning of the MSET 
model in accordance with the instant invention. The unob- 
vious benefits of the instant invention are therefore demon- 
strated by this reduction to practice. 

Alternate Embodiment and In Use and Operation Using A 
MSET Decision Model for Parameter Estimation and A 
Rule-Based Logic Sequence for Determining the Operating 
Mode of the Asset: 

In another embodiment, the same MSET decision model 
methods and procedures described hereinabove were used 
with a rule-based logic sequence for the operating mode 
determination procedure 26. A rule-based mode determina- 
tion procedure is generally specific to a single type of asset 
and may be implemented in a plurality of forms. A rule- 
based mode determination procedure may use expert system 
or procedural logic depending on the nature and complexity 
of the operating modes of the asset. In one embodiment 
herein, procedural logic representing the rules specified in 
FIG. 21 for determining the operating mode of the SSME 
was reduced to practice using C language procedural soft- 
ware as follows. 


Begin Source Code listing 

/* SSME operating mode determiner function */ 

/* Copyright 1999 by Expert Microsystems, Inc. */ 

/* All Rights Reserved */ 

#define START_COMMAND 33024.0 
#define SHUTDOWN COMMAND 35328.0 

#define COMMAND_ISSUED(COMVAL, DATUM) ((DATUM>(COMVAL - 1.0)) 

&& (DATUM <(COMVAL + 1.0))) 

enum SSME_modes SSME_mode_determiner (double *data, enum Boolean initialize) 

{ 

float pc; /* Combustion chamber pressure */ 

float vehcom; /* Vehicle command code */ 

float compc; /* Commanded chamber pressure */ 

static float last_PL=0.0; 

static int cycles_in_start=0; 

static float last_compc=0.0; 

static enum SSME_modes last_state=PREFIRE; 

if(initialize) { 

last_PL = 0.0; 
cycles_in_start = 0; 
last_compc = 0.0; 
last_state = PREFIRE; 
return PREFIRE; 
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-continued 


}; 

pc = data[PID63]; /* Chamber pressure is PID63 */ 

vehcom = data[PID280]; /* Vehicle command is PID280 */ 

compc = data[PlD287]; /* Commanded chamber pressure is PID287 */ 

/* Take care of special cases first...*/ 
if (last_state == PREFIRE) { 

if (COMM AND_IS SUED (START_COMMAND, vehcom)) { 

/* If we’re waiting for START and receive START, then we’re 
in TRANSIENT. */ 
last_state = START01; 
return START01; 

} else { 

/* Keep waiting. */ 
last_state = PREFIRE; 
return PREFIRE; 

} 

} else if (last state == SHUTDOWN || COMMAND_ISSUED 

(SHUTDOWN_COMMAND, vehcom)) { 

/* Once SHUTDOWN is detected, stay in SHUTDOWN until re- initialized. */ 

last state = SHUTDOWN; 

return SHUTDOWN; 

} 

if(last_state==START01) { 

if(++cycles_in_start<25) { /* 0 to 1.0 sec */ 
last_compc = compc; 
last_state = START01; 
return START01; 

} else { 

last_state = START12; 
return START12; 

} 

}; 

if (last state==STARTl 2) { 

if(++cycles in start<50) { /* 1.0 to 2.0 sec*/ 

last_compc = compc; 
last_state = START12; 
return START12; 

} else { 

last_state = START24; 
return START24; 

}; 

}; 

if(last state==START24) { 

if (++cycles in start<25 * 4) { /* 2.0 to 4.00 sec minimum */ 

last_compc = compc; 
last_state = START24; 
return START24; 

}; 

}; 

/* ELSE... mainstage operation. */ 

if((last state==STEADY LOW || last_state==STEADY_FULL) 

&&fabs(compc-last_compc)<3.35) { 
last_PL = pc; 
last_compc = compc; 
if(compc <2500.0) { 

last state = STEAD Y_LOW; 

return STEADY LOW 
} else { 

last_state = STEADY_FULL; 
return STEAD Y_FULL 

} 

} else {/* In transient */ 

if(fabs (compc - pc) <= (5 * 3.35)) { 

/* Transition to steady-state. */ 
last_PL = pc; 
last_compc = compc; 
if(compc <2500.0) { 

last_state STEAD Y_LOW; 
return STE AD Y_LO W ; 

} else { 

last_state = STEAD Y_FULL; 
return STEAD Y_FULL 

} 

} else if(last_state==START24) { 
last_PL = pc; 
last_compc = compc; 
return last__state 

} else if(compc>last_compc || pc<compc) { 
last_PL = pc; 



39 


US 6,917,839 B2 


40 


-continued 


last_compc = compc; 
last_state = UPTHRUST; 
return UPTHRUST; 

} else { 

last_PL = pc; 
last_compc = compc; 
last_state = DOWNTHRUST; 
return DOWNTHRUST; 

}; 

}; 

} 

End Source Code Listing 


Reduction to practice and performance testing was 
accomplished using the MSET parameter estimation tech- 
niques and rule-based operating mode determination proce- 
dure described hereinabove. Substantially identical test 
results were achieved using the rule-based method and the 
LVQ neural network method for the operating mode deter- 
mination procedure 12. This was expected because both 
methods implemented the same operating mode determina- 
tion criteria, as defined in FIG. 21, albeit using very different 
means. Reduction to practice using both neural network and 
rule-based methods illustrates that the instant invention may 
employ any one of a plurality of operating mode determi- 
nation procedures 26 to achieve the benefits described 
herein. These techniques were also demonstrated using 
mathematical models, as shown in FIG. 28. 

In Use and In Operation Using A Bayesian Belief Net- 
work for Classifying the Condition of an Asset and A 
Rule -Based Logic Sequence for Determining the Operating 
Mode of the Asset 

In one application, the novel surveillance method and 
system of the instant invention that combines methods for 
parameter estimation, fault detection, and fault classification 
was used for determining the status of instrument and 
system assets in smart, autonomous sensors and control 
components developed for the X-33 Single Stage to Orbit 
Demonstrator Hydrogen Detection System. The instant 
invention was demonstrated by applicant in the performance 
of this work under NASA Contract NAS13-01001. Testing 
performed under this contract conclusively demonstrated the 
reduction to practice for and unobvious benefits of the 
instant invention. The contract final report and new tech- 
nology disclosure documents by applicant, delivered to the 
United States Government under this contract and listed 
herein below, further describe one preferred embodiment 
and its reduction to practice, the disclosure of which is 
incorporated in its entirety herein by reference. 

NASA SBIR Phase I Final Report, “Autonomous Control 
System Components,” NASA Contract NAS 13-01 001, 
November 2001. Publication or disclosure restricted to US 
Government personnel for four years pursuant to Code of 
Federal Regulations 48 CFR 52.227-20. 

New Technology Report for NASA Contract NAS13- 
01001, “A Surveillance System and Method having Proba- 
bilistic Fault Detection and Classification,” Expert 
Microsystems, Inc. Document Control Number 2001-4473, 
November 2001. Publication or disclosure restricted to US 
Government personnel for four years pursuant to Code of 
Federal Regulations 48 CFR 52.227-20. 

In the performance of NASA Contract NAS 13-0 1001, a 
test setup was configured for exposing two redundant hydro- 
gen sensor assemblies to varying concentrations of H 2 gas. 
Each of these sensor assemblies contains two H 2 sensor 
elements (designated LR and HR), one temperature sensor 


element (designated T), and one heater control element. The 
test system computer 44 used the data acquisition means 40 
to monitor the signal sources 42 consisting of the three 
sensor elements on each of the two sensors as shown in FIG. 
29 and actuated the alarm 83 if hydrogen levels exceeded 
safety limits. The decision model 50 was comprised of an 
MSET parameter estimation model 52, a SPRT fault detec- 
tion model 54, and a BBN fault classification model 56. The 
MSET parameter estimation model and SPRT fault detection 
models were used to provide alarm indications for signals 
25 that were behaving abnormally. As each observation was 
analyzed, the SPRT returned an array of ones and zeros 
indicating whether each signal’s reading was normal or 
abnormal. The Bayesian belief network (BBN) used the 
SPRT output as positive findings to determine the state for 
30 each of its alarm indication leaf nodes. The BBN used this 
state information to determine the probability of any of the 
specified possible causes of the fault, thereby classifying the 
fault. 

BBN Configuration 

35 The BBN applies the output from each fault detector 
associated with one of its alarm indication leaf nodes as a 
positive finding for the leaf node. The BBN combines this 
information from all of its leaf nodes to determine the 
probability of any of the specified possible causes of the 
fault. The BBN fault classification submodel 57 configura- 
40 tion and probability tables can be different in each operating 
mode. However, in this case a similar BBN fault classifica- 
tion submodel configuration was used in each of the two 
system operating modes. The configuration was as follows. 

Six fault nodes were created to determine the specific 
45 sensor element that had failed. These were designated HR1 
element, LR1 element, T1 element, HR2 element, LR2 
element, and T2 element. Each sensor element node had two 
possible states, good (not faulty) and bad (faulty). Two 
additional fault nodes were created to determine whether the 
50 entire sensor had failed. These were named Sensor 1 and 
Sensor2. They also took on the states good and bad. If a node 
is in a good state, the implication is that the associated asset 
item is operating acceptably. For each of the fault nodes, we 
assign a prior probability. This is the probability that the 
55 proposition is true. For example, we believe that Sensorl 
and Sensor2 are reliable under the test conditions, so we 
believe they will behave correctly 95% of the time. This 
implies that they will behave incorrectly 5% of the time. 
This may be due to any number of reasons, e.g., not 
60 supplying power to the sensor, damage to the sensor’s 
cables, etc. The reasons for this failure are not of interest, 
only that each sensor can be expected to operate correctly 
95% of the time. We believe the probability of a bad element 
to be greater that the probability of a bad sensor, so we 
65 assigned a 10% prior probability to each of the elements 
failing. Conversely, there is 90% prior probability that each 
element has not failed. 
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Each of the alarm indication nodes is a leaf node. Each 
leaf node is a child of at least one parent, and a conditional 
probability is required for each of the combinations of parent 
states. That is, given the state of each of the parents, there is 
a probability that the alarm leaf node will be in either the 
“normal” or “abnormal” state. Each fault node indicating a 
bad element has an associated alarm leaf node as a child. 
Each “bad sensor” node has three of the leaf nodes as 
children. Therefore, each leaf node has two parents. The 
conditional probability tables require one entry for each state 
of the node for each combination of the node’s parent’s 
states. This yields a conditional probability table for each 
node that requires 8 separate entries. 

Conditional probabilities for the leaf nodes were divided 
into three conditions. If all of the parent states were “good”, 
the probability that each of the child nodes were in a normal 
state was very high (95%). If either the associated sensor or 
the element were “bad”, the probability that the sensor was 
behaving abnormally was also high (95%). If both the sensor 
and the element were “bad” the probability that the sensor 
was behaving abnormally was higher than if only one or the 
other were “bad”, so a 99% probability of failure was 
assigned. 

The MSET/SPRT submodel combination provides one 
example of a means to perform parameter estimation and 
fault detection in order to determine the state of the alarm 
indication leaf nodes. The fault detectors each return a 
normal or abnormal value for their associated alarm indica- 
tion nodes every time an observation is processed. Each of 
these values is treated as a positive finding on the associated 
alarm node. A positive finding is a value that may be applied 
with 100% certainty, or a probability of 1. That is, we are 
positive that this alarm node is reporting a “normal” or 
“abnormal” condition. 

The BBN fault classification submodel is diagrammed in 
FIG. 30. The complete set of prior probability and threshold 
values for the BBN nodes are listed in FIG. 31 for Sensor 1 
and FIG. 32 for Sensor2. The threshold values simply define 
the posterior probability value above which the node will be 
considered to be in an unacceptable condition. When the 
posterior probability value for the node exceeds the 
threshold, the BBN will classify the node and thereby the 
associated asset item as faulty. 

Test Procedure and Results 

The system was exposed to H 2 at 0%, 0.1%, 0.5%, 1.0%, 
5.0%, 10.0% and 100.0% concentrations during normal 
operation. Training data was collected during these expo- 
sures from each of these elements at one -second intervals. 
Test data was taken several months later at 3.57% H 2 and 0% 
H 2 using the same sensors and test configuration. Next, these 
data were adjusted to create additional sets of training and 
test data wherein the effect of hydrogen tank venting in the 
vicinity of the sensors was simulated. Tank venting has the 
effect of increasing the background hydrogen concentration 
in the vicinity of the sensors and can cause the hydrogen 
detection system to produce undesirable false alarms or 
missed alarms. 

An operating mode determination procedure 26 was used 
to classify each observation on the basis of the tank vent 
valve state. The procedure classified the operating mode as 
OPERATING whenever the tank valve indication was less 
than 50% open and the operating mode as VENTING 
whenever the tank valve indication was more than 50% 
open. 

Two MSET parameter estimation submodels 53 were 
trained on the training data, one for OPERATING and one 
for VENTING. Two SPRT fault detection submodels 55 
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were calibrated for each included sensor on the basis of their 
corresponding MSET submodel estimates taken over the 
training data for their respective operating modes. Each fault 
detector returned an array of ones and zeros indicating 
5 whether each sensor element’s reading was normal or abnor- 
mal. 

Test results demonstrated that the BBN is effective for 
diagnosing faults detected by the parameter estimation and 
fault detection procedures. Nominal (OPERATING) test 
10 data consisted of data obtained during exposure to 3.57% 
H 2 , and during exposure to 0% H 2 using the sensors that had 
been previously used to acquire the training data. As 
expected, the system accurately predicted the data and 
generated no alarms. 

15 In order to further test the operating mode partitioned 
BBN fault classification model, we overlaid drift errors on 
selected signal data, forcing the MSET/SPRT algorithms to 
generate fault indications. These simulated faults and their 
resulting fault classification probabilities are summarized in 
20 FIGS. 33 through 34. Beliefs (fault classification 
probabilities) are listed with the most probable causes at the 
top of the list. In each case, the operating mode partitioned 
BBN fault classification model correctly diagnosed the 
cause of the fault indications. The OPERATING mode test 
25 data in FIG. 33 illustrates that when faults occur on three of 
the sensor’s elements, the most likely cause of failure is the 
failure of the entire sensor. When one or two of the sensor’s 
elements are abnormal, the most likely cause is element 
failure. As expected, when more than one element fails on a 
30 sensor, the probability of the sensor having failed is 
increased, but a much larger increase occurs with the third 
element failing. The VENTING mode test data in FIG. 34 
illustrates the same results. 

For comparison, a hydrogen detection system model was 
35 configured and run over the same OPERATING and VENT- 
ING test data, but without the benefit of the operating mode 
determination procedure 26. That is, the decision model was 
trained and operated as a single mode unpartitioned model. 
In this case, performance was satisfactory for the OPERAT- 
40 ING test data, which yielded the same decision results as the 
partitioned model as shown in FIG. 35. However, the 
unpartitioned model failed to correctly classify all of the 
simulated faults when processing the VENTING test data as 
shown in FIG. 36. In the VENTING test cases, the drift error 
45 in signal LI is not detected. This leads to a missed alarm for 
LI when the only fault occurs on LI and to an incorrect 
diagnosis of the failure of sensor 1 when HI, LI, and T1 
simultaneously drift high. 

These comparative tests demonstrate the improvement in 
50 decision accuracy that results from the unique methods of 
the subject invention. 

While an MSET parameter estimation procedure is 
described herein above, any type of parameter estimation 
procedure can be used with the instant invention. The 
55 novelty described infra is not a modification or improvement 
to the MSET procedure, but is rather a new means of using 
any existing parameter estimation procedure so as to more 
effectively accomplish the fault classification objective. 

While a SPRT fault detection procedure is described 
60 herein above, any type of fault detection procedure can be 
used with the instant invention. The novelty described infra 
is not a modification or improvement to the SPRT procedure, 
but is rather a new means of using any existing fault 
detection procedure so as to more effectively accomplish the 
65 fault classification objective. 

While a BBN fault classification procedure is described 
herein above, any type of fault classification procedure can 
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be used with the instant invention. The novelty described 
infra is not a modification or improvement to the BBN 
procedure, but is rather a new means of using any existing 
fault classification procedure so as to more effectively 
accomplish the fault classification objective. 

Accordingly, in one aspect the present invention provides 
a surveillance system and method having fault classification 
and operating mode partitioning. 

In another aspect the present invention provides a system 
and method for performing high sensitivity surveillance of a 
wide variety of assets including industrial, utility, business, 
medical, transportation, financial, and biological processes 
and apparatuses wherein such process and/or apparatus asset 
preferably has at least two distinct modes of operation. 

In another aspect the present invention provides a system 
and method for determining the status of an asset. 

In another aspect the present invention provides a system 
and method for performing control of an asset. 

In another aspect the present invention provides a system 
and method which partitions a parameter estimation model 
for a process surveillance scheme into two or more coordi- 
nated submodels each providing improved parameter esti- 
mation for a single operating mode or related subset of 
operating modes of the process. 

In another aspect the present invention provides a system 
and method which partitions a fault detection model for a 
process surveillance scheme into two or more coordinated 
submodels each providing improved fault detection for a 
single operating mode or related subset of operating modes 
of the process. 

In another aspect the present invention provides a system 
and method which partitions a fault classification model for 
a process surveillance scheme into two or more coordinated 
submodels each providing improved fault classification for 
a single operating mode or related subset of operating modes 
of the process. 

In another aspect the present invention provides a system 
and method which partitions a parameter estimation model, 
a fault detection model, and a fault classification model for 
a process surveillance scheme into two or more coordinated 
submodels together providing improved diagnostic decision 
making for a single operating mode or related subset of 
operating modes of the process. 

In another aspect the present invention provides a system 
and method which creates an improved parameter estimation 
model for a process surveillance scheme using recorded 
operating data for an asset to train a parameter estimation 
model. 

In another aspect the present invention provides a system 
and method as characterized above which provides an 
improved system and method for surveillance of signal 
sources, detecting a fault or error state of the signal sources, 
and determining the cause of the fault or error state of the 
signal sources enabling responsive action thereto. 

In another aspect the present invention provides the 
system and method which provides an improved system and 
method for surveillance of on-line, real-time signals, or 
off-line accumulated signal data. 

In another aspect the present invention provides a system 
and method for generating an improved virtual signal esti- 
mate for at least one process parameter given an observation 
of at least one actual signal from the asset. 

In another aspect the present invention provides a system 
and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses using at 
least one parameter estimation technique for the generation 
of at least one virtual signal parameter. 
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In another aspect the present invention provides a system 
and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses using at 
5 least one fault detection technique for comparing at least one 
virtual signal parameter to at least one observed signal 
parameter. 

In another aspect the present invention provides a system 
and method as characterized above which provides an 
10 improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses using at 
least one fault detection technique for assessing at least one 
observed signal parameter. 

In another aspect the present invention provides a system 
15 and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses using at 
least one diagnostic decision making technique for assessing 
the status of the asset using at least one observed signal 
20 parameter. 

In another aspect the present invention provides a system 
and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses using at 
25 least one diagnostic decision making technique for assessing 
the status of the asset using at least one virtual signal 
parameter. 

In another aspect the present invention provides a system 
and method as characterized above which provides an 
30 improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses wherein the 
diagnostic decision technique used for assessing the status of 
the asset is a Bayesian network. 

In another aspect the present invention provides a system 
35 and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses wherein the 
diagnostic decision technique used for assessing the status of 
the asset is an expert system or other rule based system. 

40 In another aspect the present invention provides a system 
and method as characterized above which provides an 
improved system and method for ultra-sensitive analysis and 
modification of asset processes and apparatuses wherein the 
diagnostic decision technique used for assessing the status of 
45 the asset is a neural network. 

In another aspect the present invention provides a system 
and method to classify the operating mode of an asset 
wherein the classification is performed using an expert 
system having any one of a plurality of structures, training 
50 procedures, and operating procedures. 

In another aspect, the present invention provides a system 
and method to classify the operating mode of an asset 
wherein the classification is performed using a neural net- 
work having any one of a plurality of structures, training 
55 procedures, and operating procedures. 

In another embodiment of the invention, an asset surveil- 
lance system is comprised of: an operating mode partitioned 
fault classification model 56 of an asset 12 comprised of a 
plurality of fault classification submodels 57 each having an 
60 asset operating mode M,- associated thereto; a fault indica- 
tion means 70 for determining one or more fault indications 
given a set of observed asset signals from the asset 12 ; 
means for determining at least one operating mode M i of the 
asset 12 for the set of observed asset signals; a first selection 
65 means for selecting at least one of the fault classification 
submodels 57 from the operating mode partitioned fault 
classification model 56 as a function of at least the one 
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determined operating mode M t - for providing a fault classi- 
fication of determined fault indications for performing asset 
surveillance. The fault indication means further includes an 
operating mode parameter estimation model 52 comprised 
of a plurality of parameter estimation submodels 53 each 5 
having an asset operating mode M i associated thereto and a 
second selection means for selecting at least one of the 
parameter estimation submodels 53 from the operating mode 
partitioned parameter estimation model 52 as a function of 
at least the one determined operating mode M,. The fault 10 
indication means further includes means for processing the 
observed asset signals as a function of at least the one 
selected parameter estimation submodel for defining param- 
eter estimated data. Additionally, the fault indication means 
includes an operating mode partitioned fault detection model 15 
54 comprised of a plurality of fault detection submodels 55 
each having an asset operating mode M t - associated thereto. 
Furthermore, the fault indication means further includes a 
third selection means for selecting at least one of the fault 
detection submodels 55 from the operating mode partitioned 20 
fault detection model 54 as a function of at least the one 
determined operating mode M,-. Moreover, the fault indica- 
tion means further includes means for processing the 
observed asset signals as a function of at least the one 
selected fault detection submodel 55 for determining the one 
or more fault indications used for providing the fault clas- 
sification of determined fault indications by the first selec- 
tion means selecting at least one of the fault classification 
submodels 57 from the operating mode partitioned fault 
classification model 56 as a function of at least the one 
determined operating mode M ; for providing the fault clas- 
sification of determined fault indications for performing 
asset surveillance. 

In another embodiment of the invention, a method for 
determining asset status includes the steps of creating 31 an 
operating mode partitioned fault classification model 56 
comprised of a plurality of fault classification submodels 57 
each having an asset operating mode M f associated thereto; 
acquiring 62 a set of observed signal data values from an 
asset; determining 70 at least one fault indication as a 
function of the observed signal data values; determining 26 
at least one operating mode M, of the asset 12 for the set of 
observed asset signals; selecting 76 at least one fault clas- 
sification submodel 57 from the operating mode partitioned 
fault classification model 56 as a function of at least the one 
determined operating mode M„ and using at least the one 
fault indication and at least the one selected fault classifi- 
cation submodel 57 for classifying faults 76 for performing 
asset surveillance. 

In another embodiment of the invention, a method for 50 
determining asset status includes the steps of partitioning a 
decision model 50 into a plurality of partitions e.g., 52, 54, 

56, each partition having an operating mode M, associated 
thereto: employing a plurality of different methods 53, 55, 

57 from a plurality of parameter estimation methods 52, a 
plurality of fault detection methods 54, and a plurality of 
fault classification methods 56 for different partitions; deter- 
mining at least one operating mode M, of an asset 12; 
selecting at least one of the plurality of partitions as a 
function of the determined operating mode for tailoring the 
plurality of parameter estimation methods 52, the plurality 
of fault detection methods 54, and the plurality of fault 
classification methods 56 to asset surveillance as a function 
of the at least one determined operating mode M,. 

In another embodiment of the invention, a method for 
determining asset status includes the steps of acquiring a set 
of observed signal data values from an asset; producing a 
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calculated set of estimated signal data values correlative to 
the set of observed signal data values acquired from the 
asset; comparing the set of observed signal data values to the 
calculated set of estimated signal data values; determining a 
presence of a disagreement between the set of observed 
signal data values and the calculated set of estimated signal 
data values on the basis of the comparison step, and deter- 
mining a cause of a determined presence of disagreement 
between the set of observed signal data values and the 
calculated set of estimated signal data values for performing 
asset surveillance. The method further including the step of 
using a Bayesian Belief Network (BBN) fault classification 
method for determining a presence of an unacceptable asset 
status or fault condition on the basis of a disagreement 
between the set of observed signal data values and the 
calculated set of estimated signal data values derived from 
the comparison step. The method further including the step 
of performing asset control as a function of the classified 
asset status or fault condition. 

In another embodiment of the invention, a method for 
determining asset status includes the steps creating 30 a fault 
detection model 54 comprised of a plurality of fault detec- 
tion submodels 55 each having an operating mode M,- 
associated thereto; creating 31a fault classification 56 model 
25 comprised of a plurality of fault classification submodels 57 
each having an operating mode M, associated thereto; 
acquiring a set of observed signal data values from an asset 
12; determining at least one operating mode of the asset 12 
for the set of observed signal data values; selecting 64 at 
30 least one fault detection submodel from the fault classifica- 
tion model as a function of at least the one determined 
operating mode M,; determining 70 at least one fault indi- 
cation as a function of the observed signal data values; 
selecting 64 at least one fault classification submodel 57 
35 from the fault classification model 56 as a function of at least 
the one determined operating mode M„ and using at least the 
one fault indication and at least the one selected fault 
classification submodel 57 for classifying faults 76 for 
performing asset surveillance. The method further including 
40 the step of creating 29 a parameter estimation model 52 
comprised of a plurality of parameter estimation submodels 
53 each correlative to at least one training data subset 
partitioned from an unpartitioned training data set 24 and 
each having an operating mode M, associated thereto and 
45 wherein the step of determining 70 at least one fault indi- 
cation as a function of the observed signal data values 
includes the step of determining at least one fault indication 
as a function of both the estimated signal values and the 
observed signal data values. 

In another embodiment of the invention, a system for 
determining asset status is comprised of: a parameter esti- 
mation model 52 comprised of a plurality of parameter 
estimation submodels 53 each correlative to at least one 
training data subset partitioned from an unpartitioned train- 
55 ing data set 24 and each having an operating mode M, 
associated thereto; a fault detection model 54 comprised of 
a plurality of fault detection submodels 55 each having an 
operating mode M, associated thereto; a fault classification 
56 model comprised of a plurality of fault classification 
60 submodels 57 each having an operating mode M,- associated 
thereto; means for acquiring a set of observed signal data 
values from an asset 12; means for determining at least one 
operating mode of the asset 12 for the set of observed signal 
data values; means for selecting 64 at least one parameter 
65 estimation submodel 53 from the parameter estimation 
model 52 as a function of at least the one determined 
operating mode M,-; means for calculating a set of estimated 
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signal values from at least one selected parameter estimation 
submodel 53; means for selecting 64 at least one fault 
detection submodel from the fault classification model as a 
function of at least the one determined operating mode M-; 
means for determining 70 at least one fault indication as a 5 
function of both the estimated signal values and observed 
signal data values; means for selecting 64 at least one fault 
classification submodel 57 from the fault classification 
model 56 as a function of at least the one determined 
operating mode M,-, and means for using at least the one fault 
indication and at least the one selected fault classification 
submodel 57 for classifying faults 76 for performing asset 
surveillance. 

Moreover, having thus described the invention, it should 
be apparent that numerous structural modifications and 
adaptations may be resorted to without departing from the 
scope and fair meaning of the instant invention as set forth 
hereinabove and as described hereinbelow by the claims. 

I claim: 

1. An asset surveillance system, comprising in combina- 2Q 
tion: 

an operating mode partitioned fault classification model 
of an asset comprised of a plurality of fault classifica- 
tion submodels each having an asset operating mode 
associated thereto; 25 

a fault indication means for determining one or more fault 
indications given a set of observed asset signals from 
the asset; 

means for determining at least one operating mode of the 
asset for the set of observed asset signals; 30 

a first selection means for selecting at least one of the fault 
classification submodels from the operating mode par- 
titioned fault classification model as a function of at 
least the one determined operating mode for providing 
a fault classification of determined fault indications for 35 
performing asset surveillance. 

2. The system of claim 1 wherein said fault indication 

means further includes an operating mode partitioned 
parameter estimation model comprised of a plurality of 
parameter estimation submodels each having an asset oper- 40 
ating mode associated thereto and a second selection means 
for selecting at least one of the parameter estimation sub- 
models from the operating mode partitioned parameter esti- 
mation model as a function of at least the one determined 
operating mode. 45 

3. The system of claim 2 wherein said fault indication 

means further includes means for processing the observed 
asset signals as a function of at least the one selected 
parameter estimation submodel for defining parameter esti- 
mated data. 50 

4. The system of claim 3 wherein said fault indication 

means includes an operating mode partitioned fault detec- 
tion model comprised of a plurality of fault detection 
submodels each having an asset operating mode associated 
thereto. 55 

5. The system of claim 4 wherein said fault indication 

means further includes a third selection means for selecting 
at least one of the fault detection submodels from the 
operating mode partitioned fault detection model as a func- 
tion of at least the one determined operating mode. 60 

6. The system of claim 5 wherein said fault indication 
means further includes means for processing the parameter 
estimated data as a function of at least the one selected fault 
detection submodel for determining the one or more fault 
indications used for providing the fault classification of 65 
determined fault indications by said first selection means 
selecting at least one of the fault classification submodels 
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from the operating mode partitioned fault classification 
model as a function of at least the one determined operating 
mode for providing the fault classification of determined 
fault indications for performing asset surveillance. 

7. The system of claim 1 wherein the fault classification 
of determined fault indications predicts asset failures. 

8. The system of claim 7 wherein the fault classification 
of determined fault indications predicts specific asset fail- 
ures including one or more sensor failures. 

9. The system of claim 7 wherein the fault classification 
of determined fault indications predicts specific asset fail- 
ures including one or more equipment failures. 

10. The system of claim 7 wherein the fault classification 
of determined fault indications predicts specific asset fail- 
ures including an undesirable process operating condition. 

11. The system of claim 1 further including means for 
performing asset control as a function of the fault classifi- 
cation of determined fault indications. 

12. An asset surveillance method, the steps including: 
creating an operating mode partitioned fault classification 

model comprised of a plurality of fault classification 
submodels each having an asset operating mode asso- 
ciated thereto 

acquiring a set of observed signal data values from an 
asset; 

determining at least one fault indication as a function of 
the observed signal data values; 
determining at least one operating mode of the asset for 
the set of observed asset signals; 
selecting at least one fault classification submodel from 
the operating mode partitioned fault classification 
model as a function of at least the one determined 
operating mode, and 

using at least the one fault indication and at least the one 
selected fault classification submodel for classifying 
faults for performing asset surveillance. 

13. The method of claim 12 further including the step of 
performing asset control as a function of the classified faults. 

14. The method of claim 12 further including the step of 
predicting asset failures as a function of the classified faults. 

15. The method of claim 12 further including the step of 
predicting asset failures including one or more sensor fail- 
ures. 

16. The method of claim 12 further including the step of 
predicting asset failures including one or more equipment 
failures. 

17. The method of claim 12 further including the step of 
predicting asset failures including an undesirable process 
operating condition. 

18. An asset surveillance method, the steps including: 
partitioning a decision model into a plurality of partitions, 

each partition having an operating mode associated 
thereto: 

employing a plurality of different methods from a plural- 
ity of parameter estimation methods, a plurality of fault 
detection methods, and a plurality of fault classification 
methods for different partitions; 
determining at least one operating mode of an asset; 
selecting at least one the plurality of partitions as a 
function of the determined operating mode for tailoring 
the plurality of parameter estimation methods, the 
plurality of fault detection methods, and the plurality of 
fault classification methods for asset surveillance as a 
function of the at least one determined operating mode. 

19. The method of claim 18 further including the step of 
performing asset control as a function of the classified faults. 
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20. The method of claim 18 further including the step of 
predicting asset failures as a function of the classified faults. 

21. The method of claim 18 further including the step of 

predicting asset failures including one or more sensor fail- 
ures. 5 

22. The method of claim 18 further including the step of 
predicting asset failures including one or more equipment 
failures. 

23. The method of claim 18 further including the step of 
predicting asset failures including an undesirable process 10 
operating condition. 

24. An asset surveillance method, the steps including: 
acquiring a set of observed signal data values from an 

asset; 

producing a calculated set of estimated signal data values 15 
correlative to the set of observed signal data values 
acquired from the asset; 

comparing the set of observed signal data values to the 
calculated set of estimated signal data values; 2Q 

determining a presence of a disagreement between the set 
of observed signal data values and the calculated set of 
estimated signal data values on the basis of the com- 
parison step, and 

determining a cause of a determined presence of disagree- 25 
ment between the set of observed signal data values and 
the calculated set of estimated signal data values for 
performing asset surveillance. 

25. The method of claim 24 further including the step of 
using a Bayesian Belief Network (BBN) fault classification 30 
method for determining a cause of a disagreement between 
the set of observed signal data values and the calculated set 

of estimated signal data values on the basis of the compari- 
son step. 

26. The method of claim 24 further including the step of 35 
performing asset control as a function of the classified faults. 

27. An asset surveillance method, the steps including: a 
method for determining asset status includes the steps of: 

creating a fault detection model comprised of a plurality 
of fault detection submodels each having an operating 40 
mode associated thereto; 

creating a fault classification model comprised of a plu- 
rality of fault classification submodels each having an 
operating mode associated thereto; 
acquiring a set of observed signal data values from an 
asset; 

determining at least one operating mode of the asset for 
the set of observed signal data values; 
selecting at least one fault detection submodel from the 50 
fault detection model as a function of at least the one 
determined operating mode; 

determining at least one fault indication as a function of 
the observed signal data values; 
selecting at least one fault classification submodel from 55 
the fault classification model as a function of at least the 
one determined operating mode, and 
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using at least the one fault indication and at least the one 
selected fault classification submodel for classifying 
faults for performing asset surveillance. 

28. The method of claim 27 further including the step of 
creating a parameter estimation model comprised of a plu- 
rality of parameter estimation submodels each having an 
operating mode associated thereto. 

29. The method of claim 28 further including the step of 
selecting at least one parameter estimation submodel from 
the parameter estimation model as a function of at least the 
one determined operating mode. 

30. The method of claim 29 further including the step of 
calculating a set of estimated signal values from at least one 
selected parameter estimation submodel. 

31. The method of claim 30 wherein the step of deter- 
mining at least one fault indication as a function of the 
observed signal data values includes the step of determining 
at least one fault indication as a function of both the 
estimated signal values and the observed signal data values. 

32. An asset surveillance system, comprising in combi- 
nation: 

a parameter estimation model comprised of a plurality of 
parameter estimation submodels each having an oper- 
ating mode associated thereto; 

a fault detection model comprised of a plurality of fault 
detection submodels each having an operating mode 
associated thereto; 

a fault classification model comprised of a plurality of 
fault classification submodels each having an operating 
mode associated thereto; 

means for acquiring a set of observed signal data values 
from an asset; 

means for determining at least one operating mode of the 
asset for the set of observed signal data values; 

means for selecting at least one parameter estimation 
submodel from the parameter estimation model as a 
function of at least the one determined operating mode; 

means for calculating a set of estimated signal values 
from at least one selected parameter estimation sub- 
model; 

means for selecting at least one fault detection submodel 
from the fault classification model as a function of at 
least the one determined operating mode; 

means for determining at least one fault indication as a 
function of the estimated signal values and observed 
signal data values; 

means for selecting at least one fault classification sub- 
model from the fault classification model as a function 
of at least the one determined operating mode, and 

means for using at least the one fault indication and at 
least the one selected fault classification submodel for 
classifying faults for performing asset surveillance. 





